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Description 

Background of the Invention 



[0001] Cell differentiation is the central characteristic of morphogenesis which initiates in the embryo, and continues 
to various degrees throughout the life of an organism in adult tissue repair and regeneration mechanisms. The degree 
of morphogenesis in adult tissue varies among different tissues and is related, among other things, to the degree of 
cell turnover in a given tissue. On this basis, tissues can be divided into three broad categories: (1 ) tissues with static 
cell populations such as nerve and skeletal muscle where there is no cell division and most of the cells formed during 
early development persist throughout adult life; (2) tissues containing conditionally renewing populations such as liver 
where there is generally little cell division but, in response to an appropriate stimulus, cells can divide to produce 
daughters of the same differentially defined type; and (3) tissues with permanently renewing populations including 
blood, testes and stratified squamous epithelia which are characterized by rapid and continuous cell turnover in the 
adult. Here, the terminally differentiated cells have a relatively short life span and are replaced through proliferation of 
a distinct subpopulation of cells, known as stem or progenitor cells. 

[0002] The cellular and molecular events which govern the stimulus for differentiation of these cells is an area of 
intensive research. In the medical field, it is anticipated that the discovery of factor(s) which control cell differentiation 
and tissue morphogenesis will significantly advance medicine's ability to repair and regenerate diseased or damaged 
mammalian tissues and organs. Particularly useful areas include reconstructive surgery and in the treatment of tissue 
degenerative diseases including arthritis, emphysema, osteoporosis, cardiomyopathy, cirrhosis, and degenerative 
nerve diseases. 

[0003] A number of different factors have been isolated in recent years which appear to play a role in cell differenti- 
ation. Some of these factors are gene transcription activators such as the NOTCH gene, identified in Drosophila and 
the related XOTCH gene identified in Xenopus, as well as a number of transcription activators identified in Caenorhab- 
ditis elegans. " 

[0004] The hemopoietic system, because of its continually renewing cell population, is an area of concentrated study. 
Factors identified in this system which may be involved in cell renewal include interleukin 3 (IL-3), erythropoietin, the 
CSFs (GM-CSF, G-CSF, M-CSF et al.) and various stem cell growth factors. 

[0005] Other proteins thought to play a role in cell differentiation include proteins that are members of the family of 
insulin-like growth factors (IGF), members of the family of heparin-binding growth factors, (e.g., FGF - acidic and basic 
fibroblast growth factors, and ECDGF - embryonal carcinoma-derived growth factor) as well as several transforming 
oncogenes (hst and int-2, see for example, Heath et al., (1988), J. Cell Sci. Suppl, 10:256-256.) DIF (Differentiation 
Inducing Factor), identified in Dictyostelium discoideum, is another bioregulatory protein, directing prestock cell differ- 
entiation in that organism. 

[0006] The structurally related proteins of the TGF-p superfamily of proteins also have been identified as involved 
in a variety of developmental events. For example, TGF-p and the polypeptides of the inhibin/activin group appear to 
play a role in the regulation of cell growth and differentiation. MIS (Mullerian Inhibiting Substance) causes regression 
of the Mullerian duct in development of the mammalian male embryo, and DPP, the gene product of the Drosophila 
decapentaplegic complex is required for appropriate dorsal-ventral specification. Similarly, Vg-1 is involved in meso- 
derm induction in Xenopus, and Vgr-1 has been identified in a variety of developing murine tissues. 

[0007] Another source that has revealed a wealth of information is in the area of bone morphogenesis. The devel- 
opment and study of a bone model system has identified the developmental cascade of bone differentiation as con- 
sisting of chemotaxis of mesenchymal cells, proliferation of these progenitor cells, differentiation of these cells into 
chrondroblasts, cartilage calcification, vascular invasion, bone formation, remodeling, and finally, marrow differentiation 
(Reddi (1 981 ) Collagen Rel. Res. 1 :209-206). Proteins capable of inducing endochondral bone formation in a mammal 
when implanted in association with a matrix now have been identified in a number of different mammalian species, as 
have the genes encoding these proteins, (see, for example, U.S. Patent No. 4,968,590 and U.S. Patent No. 5,011,691 , 
Ozkaynak, et al., (1990) EMBO J 9:2085-2093, and Ozkaynak et al., (1991) Biochem. Biophys. Res. Commn. 179 : 
116-123. WO-A-89 09788, WO-A-89 09787, WO-A-90 03733, WO-A-91 05802 and WO-A-91 18558. These proteins, 
which share significant amino acid sequence homology with one another as well as structural similarities with various 
members of the TGF-p super family of proteins, have been shown to induce endochondral bone formation and/or 
cartilage formation when implanted, in a mammal in association with a suitably modified matrix. Proteins capable of 
inducing a similar developmental cascade of tissue morphogenesis of other tissues have not been identified. 

[0008] It is an object of this invention to provide morphogenic proteins ("morphogens"), and methods for identifying 
these proteins, which are capable of inducing the developmental cascade of tissue morphogenesis for a variety of 
tissues in mammals different from bone or cartilage. This morphogenic activity includes the ability to induce proliferation 
and differentiation of progenitor cells, and the ability to support and maintain the differentiated phenotype through the 
progression of events that results in the formation of adult tissue. Another object is to provide genes encoding these 
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proteins as well as methods for the expression and isolation of these proteins, from either natural sources or biosynthetic 
sources, using recombinant DNA techniques. Still another object is to provide tissue-specific acellular matrices that 
may be used in combination with these proteins, and methods for their production. 

Other objects include providing methods for increasing a progenitor cell population in a mammal, methods for stimu- 
5 lating progenitor cells to differentiate in vivo or in vitro and maintain their differentiated phenotype, methods for inducing 
tissue-specific growth in vivo and methods for the replacement of diseased or damaged tissue in vivo. These and other 
objects and features of the invention will be apparent from the description, drawings, and claims which follow. 

Summary or the Invention 

10 

[0009] This invention is defined by the claims appended hereto. 

[0010] Thus, described herein are morphogenic proteins (“morphogens") capable of inducing the. developmental 
cascade of tissue-specific non skeletal morphogenesis in a mammal. In particular, these proteins are capable of in- 
ducing the proliferation of uncommitted progenitor cells, and inducing the differentiation of these stimulated progenitor 
15 cells in a tissue-specific manner under appropriate environmental conditions. In addition, the morphogens are capable 
of supporting the growth and maintenance of these differentiated cells. These morphogenic activities allow the proteins 
to initiate and maintain the developmental cascade of tissue morphogenesis in an appropriate, morphogenically per- 
missive environment, stimulating stem cells to proliferate and differentiate in a tissue-specific manner, and inducing 
the progression of events that culminate in new tissue formation. These morphogenic activities also allow the proteins 
20 to stimulate the u redifferentiation“ of cells previously induced to stray from their differentiation path. Under appropriate 
environmental conditions it is anticipated that these morphogens also may stimulate the "dedifferentiation" of committed 
cells (see infra.) 

[0011] As described herein, the proteins and compositions are useful in the replacement of diseased or damaged 
tissue in a mammal, particularly when the damaged tissue interferes with normal tissue or organ function. Accoraingly, 
25 it is anticipated that the proteins will be useful in the repair of damaged tissue such as, for example, damaged lung 
tissue resulting from emphysema, cirrhotic kidney or liver tissue, damaged heart or blood vessel tissue, as may result 
from cardiomyopathies and/or atherothrombotic or cardioembolic strokes, damaged stomach tissue resulting from ul- 
ceric perforations or their repair, damaged neural tissue as may result from physical injury, degenerative diseases such 
as Alzheimer's disease or multiple sclerosis or strokes, damaged dentin tissu as may result from disease or mechanical 
30 injury. When the proteins are provided to, or their expression stimulated at, a tissue-specific locus, the developmental 
cascade of tissue morphogenesis is induced (see infra). Cells stimulated ex vivo by contact with the proteins or agents 
capable of stimulating morphogen expression in these cells also may be provided to the tissue locus. In these cases 
the existing tissue provides the necessary matrix requirements, providing a suitable substratum for the proliferating 
and differentiating cells in a morphogenically permissive environment, as well as providing the necessary signals for 
35 directing the tissue-specificity of the developing tissue. 

Alternatively, the proteins or stimulated ceils may be combined with a formulated matrix and implanted as a device at 
a locus in vivo. The formulated matrix should be a biocompatible, preferably biodegradable, appropriately modified 
tissue-specific acellular matrix having the characteristics described below. 

[0012] In many instances, the loss of tissue function results from scar tissue, formed in response to an initial or 
repeated injury to the tissue. The degree of scar tissue formation generally depends on the regenerative properties of 
the injured tissue, and on the degree and type of injury. Thus, described herein are morphogens that may be used to 
prevent or substantially inhibit the formation of scar tissue by providing the morphogens, or morphogen-stimulated 
cells, to a newly injured tissue loci (see infra). 

[001 3] The morphogens also may be used to increase or regenerate a progenitor or stem cell population in a mammal. 
45 For example, progenitor cells may be isolated from an individual's bone marrow, stimulated ex vivo for a time and at 
a morphogen concentration sufficient to induce the cells to proliferate, and returned to the bone marrow. Other sources 
of progenitor cells that may be suitable include biocompatible cells obtained from a cultured cell line, stimulated in 
culture, and subsequently provided to the body. Alternatively, the morphogen may be provided systemically, or implant- 
ed, injected or otherwise provided to a progenitor cell population in an individual to induce its mitogenic activity in vivo. 
50 For example, an agent capable of stimulating morphogen expression in the progenitor cell population of interest may 
be provided to the cells in vivo , for example systemically, to induce mitogenic activity. Similarly, a particular population 
of hemopoietic stem cells may be increased by the morphogens, for example by perfusing an individual’s blood to 
extract the cells of interest, stimulating these cells ex vivo, and returning the stimulated cells to the blood. It is anticipated 
that the ability to augment an individual's progenitor cell population will significantly enhance existing methods for 
55 treating disorders resulting from a loss or reduction of a renewable cell population. Two particularly significant appli- 
cations include the treatment of blood disorders and impaired or lost immune function. Other cell populations whose 
proliferation may be exploited include the stem cells of the epidermis, which may be used in skin tissue regeneration, 
and the stem cells of the gastrointestinal lining, for example, in the healing of ulcers. 
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[0014] As described herein, the morphogens also may be used to support the growth and maintenance of differen- 
tiated ceils, inducing existing differentiated cells to continue expressing their phenotype. It is anticipated that this activity 
will be particularly useful in the treatment of tissue disorders where loss of function is caused by cells becoming se- 
nescent or quiescent, such as may occur in osteoporosis. Application of the protein directly to the cells to be treated, 
or providing it by systemic injection, can be used to stimulate these cells to continue expressing their phenotype, 
thereby significantly reversing the effects of the dysfunction (see infra). In addition, the morphogens also may be used 
in gene therapy protocols to stimulate the growth of quiescent cells, thereby potentially enhancing the ability of these 
cells to incorporate exogenous DNA. 

[001 5] As described herein, the morphogens also may be used to induce "redifferentiation" of cells that have strayed 
from their differentiation pathway, such as can occur during tumorgenesis. It is anticipated that this activity of the 
proteins will be particularly useful in treatments to reduce or substantially inhibit the growth of neoplasms. The method 
also is anticipated to induce the de-and re-differentiation of these cells. As described supra, the proteins may be pro- 
vided to the cells directly or systemically, or an agent capable of stimulating morphogen expression in vivo may be 
provided. 

[0016] Finally, modulations of endogenous morphogen levels may be monitored as part of a method for detecting 
non-skeletal tissue dysfunction. Specifically, modulations in endogenous morphogen levels are anticipated to reflect 
changes in tissue or organ stasis. Tissue stasis may be monitored by detecting changes in the levels of the morphogen 
itself. For example, tissue samples may be obtained at intervals and the concentration of the morphogen present in . 
the tissue detected by standard protein detection means known to those skilled in the art. As an example, a binding 
protein capable of interacting specifically with the morphogen of interest, such as an anti-morphogen antibody, may 
be used to detect the morphogen in a standard immunoassay. The morphogen levels detected then may be compared, 
the changes in the detected levels being indicative of the status of the tissue. Modulations in endogenous morphogen 
levels also may be monitored by detecting fluctuations in the body’s natural antibody titer to morphogens (see infra.) 
[0017] The morphogenic proteins and compositions described herein can be isolated from a variety of naturally- 
occurring sources, or they may be constructed biosynthetically using conventional recombinant DNA technology. Sim- 
ilarly, the matrices may be derived from organ-specific tissue, or they may be formulated synthetically, as described 
below. 

[0018] A key to these developments was the discovery and characterization of naturally-occurring osteogenic pro- 
teins followed by observation of their remarkable properties. These proteins, originally isolated from bone, are capable 
of inducing the full developmental cascade of bone formation, including vascularization, mineralization, and bone mar- 
row differentiation, when implanted in a mammalian body in association with a suitably modified matrix. Native proteins 
capable of inducing this developmental cascade, as well as DNA sequences encoding these proteins now have been 
isolated and characterized for a number of different species (e.g., human and mouse OP-1 , OP-2, and CBMP-2. See, 
for example, U.S. Patent Nos. 4,968,590 and 5,011,691; Sampath et al. (1990) J. Bio. Chem 265 :13198-13205; Oz- 
kaynak, et al. (1 990) EMBO J 9 :2085-2 093 and Ozkaynak, et al. (1 991 ) Biochem. Biophys. Res. Commn. 179 ; 116-1 23.) 
The mature forms of these proteins share substantial amino acid sequence homology, especially in the C-terminal 
regions of the mature proteins. In particular, the proteins share a conserved six or seven cysteine skeleton in this region 
(e.g., the linear arrangement of these C-terminal cysteine residues is essentially conserved in the different proteins, 
in addition to other, apparently required amino acids (see Table II, infra)). 

[0019] Polypeptide chains not normally associated with bone or bone formation, but sharing substantial amino acid 
sequence homology with the C-terminus of the osieogenic proteins, including the conserved six or seven cysteine 
skeleton, also have been identified as competent for inducing bone in mammals. Among these are amino acid se- 
quences identified in Drosophila and Xenopus, (e.g., DPP and Vgl; see, for example, U.S. Patent No. 5,011,691 and 
Table II, infra). In addition, non-native biosynthetic constructs designed based on extrapolation from these sequence 
homologies, including the conserved six or seven cysteine skeleton, have been shown to induce endochondral bone 
formation in mammals when implanted in association with an appropriate matrix (see U.S. Pat. No. 5,011,691 and 
Table III, infra). 

[0020] It has now been discovered that this "family" of proteins sharing substantial amino acid sequence homology 
and the conserved six or seven cysteine skeleton are true morphogens, capable of inducing, in addition to bone and 
cartilage, tissue-specific morphogenesis for a variety of other organs and tissues. The proteins apparently bind to 
surface receptors or otherwise contact and interact with progenitor cells, predisposing or stimulating the cells to pro- 
liferate and differentiate in a morphogenically permissive environment. The morphogens are capable of inducing the 
developmental cascade of cellular and molecular events that culminate in the formation of new organ-specific tissue, 
including any vascularization, connective tissue formation, and nerve ennervation as required by the naturally occurring 
tissue. 

[0021] The term "morphogen" as used herein, refers to the aforementioned osteogenic proteing and polypeptide 
chains sharing substantial amino acid sequence, homology with the C-terminus of said osteogenic proteins. 

[0022] It also has been discovered that the way in which the cells differentiate, whether, for example, they differentiate 
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into bone-producing osteoblasts, hemopoietic cells, or liver cells, depends on the nature of their local environment (see 
infra). Thus, in addition to requiring a suitable substratum on which to anchor, the proliferating and differentiating cells 
also require appropriate signals to direct their tissue-specificity. These signals may take the form of cell surface markers. 
[0023] When the morphogens (or progenitor cells stimulated by these morphogens) are provided at a tissue-specific 
s locus (e.g., by systemic injection or by implantation or injection at a tissue-specific locus), the existing tissue at that 
locus, whether diseased or damaged, has the capacity of acting as a suitable matrix. Alternatively, a formulated matrix 
may be externally provided together with the stimulated progenitor cells or morphogen, as may be necessary when 
the extent of injury sustained by the damaged tissue is large. The matrix should be a biocompatible, suitably modified 
acellular matrix having dimensions such that it allows the influx, differentiation, and proliferation of migratory progenitor 
10 cells, and is capable of providing a morphogenically permissive environment (see infra). The matrix preferably is tissue- 
specific, and biodegradable. 

[0024] Formulated matrices may be generated from dehydrated organ-specific tissue, prepared for example, by 
treating the tissue with solvents to substantially remove the non-structural components from the tissue. Alternatively, 
the matrix may be formulated synthetically using a biocompatible, preferably in vivo biodegradable, structural polymer 
15 such as collagen in association with suitable tissue-specific cell attachment factors. Currently preferred structural pol- 
ymers comprise tissue-specific collagens. Currently preferred cell attachment factors include glycosaminoglycans and 
proteoglycans. The matrix further may be treated with an agent or agents to increase the number of pores and micropits 
on its surfaces, so as to enhance the influx, proliferation and differentiation of migratory progenitor cells from the body 
of the mammal. 

20 [0025] Among the morphogens useful are proteins originally identified as osieogenic proteins, such as the OP-1, 

OP-2 and CBMP2 proteins, as well as amino acid sequence-related proteins such as DPP (from Drosophila), Vgl (from 
Xenopus), Vgr-1 (from mouse, see Table II and Seq. ID Nos.5-14), and the recently identified GDF-1 protein (Seq. ID 
No. 14). The members of this family, which include members of the TGF-p super-family of proteins, share substantial 
amino acid sequence homology in their C-terminal regions. Table I, below, describes the various morphogens identified 
25 to date, including their nomenclature as used herein, and Seq. ID references. 



TABLE I 



30 


M OP-1" 


Refers generically to the group of morphogenically active proteins expressed from part or all of a 
DNA sequence encoding OP-1 protein, including allelic and species variants thereof, e.g., human 




OP-1 ( n hOP-1“, Seq. ID No. 5, mature 

protein amino acid sequence), or mouse OP-1 (“mOP-1 ", Seq. ID No. 6, mature protein amino acid 
sequence.) The conserved seven cysteine skeleton is defined by residues 38 to 1 39 of Seq. ID Nos. 
5 and 6. The cDNA sequences and the amino acids encoding the full length proteins are provided 
in Seq. Id Nos. 16 and 17 (hOPI) and Seq. ID Nos. 18 and 19 (mOPI.) The mature proteins are 






defined by residues 293-431 (hOPI ) and 292-430 (mOPI ). The n pro"regions of the proteins, cleaved 
to yield the mature, morphogenically active proteins are defined essentially by residues 30-292 
(hOPI) and residues 30-291 (mOPI). 




"OP-2" 


refers generically to the group of active proteins expressed from part or all of a DNA sequence 
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encoding OP-2 protein, including allelic and species variants thereof, e.g., human OP-2 ("hOP-2“, 




Seq. ID No. 7, mature protein amino acid sequence) or mouse OP-2 ("mOP-2 M , Seq.. ID No. 8, mature 
protein amino acid sequence). The conserved seven cysteine skeleton is defined by residues 38 to 
139 of Seq. ID Nos. 7 and 8. The cDNA sequences and the amino acids encoding the full length 
proteins are provided in Seq. Id Nos. 20 and 21 (hOP2) and Seq. ID Nos. 22 and 23 (mOP2.) The 


45 




mature proteins are defined essentially by residues 264-402 (hOP2) and 261 -399 (mOP2). The “pro" 
regions of the proteins, cleaved to yield 

the mature, morphogenically active proteins are defined essentially by residues 18-263 (hOP2) and 
residues 18-260 (mOPI). 




“CBMP2” 


refers generically to the morphogenically active proteins expressed from a DNA sequence encoding 
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the CBMP2 proteins, including allelic and species variants thereof, e.g., human CBMP2A ("CBMP2A 
(fx)'\ Seq ID No. 9) or human CBMP2B DNA (“CBMP2B(fx) u , Seq. ID No. 10). 




’DPP(fx)" 


refers to protein sequences encoded by the Drosophila DPP gene and defining the conserved seven 
cysteine skeleton (seq. ID No. 11). 




■Vgl(fx)" 


refers to protein sequences encoded by the Xenopus Vgl gene and defining the conserved seven 
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cysteine skeleton (Seq. ID No. 12). 




”Vgr-1(fx)“ 


refers to protein sequences encoded by the murine Vgr-1 gene and defining the conserved seven 
cysteine skeleton (Seq. ID No. 13). 
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TABLE I (continued) 



"GDF-1 (fx)“ refers to protein sequences encoded by the human GDF-1 gene and defining the conserved seven 
cysteine skeleton (seq. ID No. 14). 



[0026] The OP-2 proteins have an additional cysteine residue in this region (e.g., see residue 41 of Seq. ID Nos. 7 
and 8), in addition to the conserved cysteine skeleton in common with the other proteins in this family. The GDF-1 
protein has a four amino acid insert within the conserved skeleton (residues 44-47 of Seq. ID No. 14) but this insert 
likely does not interfere with the relationship of the cysteines in the folded structure. In addition, the CBMP2 proteins 
are missing one amino acid residue within the cysteine skeleton. 

[0027] The morphogens are inactive when reduced, but are active as oxidized homodimers and when oxidized in 
combination with other morphogens of this invention. Thus, as defined herein, a morphogen of this invention is a dimeric 
protein comprising a pair of polypeptide chains, wherein each polypeptide chain comprises at least the C-terminal six 
cysteine skeleton defined by residues 43-139 of Seq. ID No. 5, including functionally equivalent arrangements of these 
cysteines (e.g., amino acid insertions or deletions which alter the linear arrangement of the cysteines in the sequence 
but not their relationship in the folded structure), such that, when the polypeptide chains are folded, the dimeric protein 
species comprising the pair of polypeptide chains has the appropriate three-dimensional structure, including the ap- 
propriate intra-or inter-chain disulfide bonds such that the protein is capable of acting as a morphogen as defined 
herein. Specifically, the protein is capable of all of the following biological functions in a morphogenically permissive 
environment: stimulating proliferation of progenitor cells; stimulating the differentiation of progenitor cells; stimulating 
the proliferation of differentiated cells; and supporting the growth and maintenance of differentiated cells, including the 
"redifferentiation" of these cells. In addition, it is also anticipated that the morphogens will be capable of inducing 
dedifferentiation of committed cells under appropriate environmental conditions. 

[0028] Preferably, the morphogens comprise one of two species of generic amino acid sequences: Generic Sequence 

1 (Seq. ID No. 1) or Generic Sequence 2 (Seq. ID No. 2); where each Xaa indicates one of the 20 naturally-occurring 
L-isomer, a-amino acids or a derivative thereof. Generic Sequence 1 comprises the conserved six cysteine skeleton 
and Generic Sequence 2 comprises the conserved six cysteine skeleton plus the additional cysteine identified in OP- 

2 (see residue 36, Seq. ID No. 2). In another preferred aspect, these sequences further comprise the following additional 
sequence at their N-terminus: 



Cys Xaa Xaa Xaa Xaa (Seq. ID No. 15) 

1 ' 5 

35 

[0029] Preferred amino acid sequences within the foregoing generic sequences include: Generic Sequence 3 (Seq. 
ID No. 3) and Generic Sequence 4 (Seq. ID No. 4), listed below, which accommodate the homologies shared among 
the various preferred members of this morphogen family identified to date (see Table II), as well as the amino acid 
40 sequence variation among them. Generic Sequences 3 and 4 are composite amino acid sequences of the proteins 
presented in Table II and identified in Seq. ID Nos. 5-14. The generic secuences include both the amino acid identity 
shared by the sequences in Table II, as well as alternative residues for the variable positions within the sequence. Note 
that these generic sequences allow for an additional cysteine at position 41 or 46 in Generic Sequences 3 or 4, re- 
spectively, providing an appropriate cysteine skeleton where inter- or intramolecular disulfide bonds can form, and 
45 contain certain critical amino acids which influence the tertiary structure of the proteins. 
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Generic Sequence 3 
Leu Tyr ValXaa Phe 

1 5 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 
15 20 

Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
25 30 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
40 45 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
55 60 

Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 



40 



45 



50 



55 



Xaa Xaa Xaa Leu Xaa Xaa xaa 
70 75 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
85 90 

Xaa Cys Gly Cys Xaa 
95 
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wherein each Xaa is independently selected from a group of one or more specified amino acids defined as follows: 
"Res.” means "residue" and Xaa at res.4 = (Ser, Asp or Glu); Xaa at res.6 = (Arg, Gin, Ser or Lys); Xaa at res.7 = (Asp 
or Glu); Xaa at res.8 = (Leu or Val); Xaa at res. 11 = (Gin, Leu, Asp, His or Asn); Xaa at res.12 = (Asp, Arg or Asn); 
Xaa at res.14 = (lie or Val); Xaa at res.15 = (lie or Val); Xaa at res.18 = (Glu, Gin, Leu, Lys, Pro or Arg); Xaa at res.20 
5 = (Tyr or Phe); Xaa at res.21 = (Ala, Ser, Asp, Met, His, Leu or Gin); Xaa at res.23 = (Tyr, Asn or Phe); Xaa at res.26 

= (Glu, His, Tyr, Asp or Gin); Xaa at res.28 = (Glu, Lys, Asp or Gin); Xaa at res.30 = (Ala, Ser, Pro or Gin); Xaa at res. 
31 = (Phe, Leu or Tyr); Xaa at res.33 = (Leu or Val); Xaa at res.34 = (Asn, Asp, Ala or Thr); Xaa at res.35 = (Ser, Asp, 
Glu, Leu. or Ala); Xaa at res.36 = (Tyr, Cys, His, Ser or lie); Xaa at res.37 = (Met, Phe, Gly or Leu); Xaa at res.38 = 
(Asn or Ser); Xaa at res.39 = (Ala, Ser or Gly); Xaa at res.40 = (Thr, Leu or Ser); Xaa at res.44 = (He or Val); Xaa at 
io res.45 = (Val or Leu); Xaa at res.46 = (Gin or Arg); Xaa at res.47 = (Thr, Ala or Ser); Xaa at res.49 = (Val or Met); Xaa 
at res.50.= (His or Asn); Xaa at res.51 = (Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.52 = (lie, Met, Asn, Ala or Val); 
Xaa at res.53 = (Asn, Lys, Ala or Glu); Xaa at res.54 = (Pro or Ser); Xaa at res.55 = (Glu, Asp, Asn, or Gly); Xaa at 
res.56 = (Thr, Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at res.57 = (Val, Ala or lie); Xaa at res.58 = (Pro or Asp); Xaa at 
res.59 = (Lys or Leu); Xaa at res.60 = (Pro or Ala); Xaa at res.63 = (Ala or Val); Xaa at res.65 = (Thr or Ala); Xaa at 
15 res.66 = (Gin, Lys, Arg or Glu); Xaa at res.67 = (Leu, Met or Val); Xaa at res.68 = (Asn, Ser or Asp); Xaa at res.69 = 
(Ala, Pro or Ser); Xaa at res.70 = (lie, Thr or Val); Xaa at res.71 = (Ser or Ala); Xaa at res.72 = (Val or Met); Xaa at 
res.74 = (Tyr or Phe); Xaa at res.75 = (Phe, Tyr or Leu); Xaa at res.76 = (Asp or Asn); Xaa at res.77 = (Asp, Glu, Asn 
or Ser); Xaa at res.78 = (Ser, Gin, Asn or Tyr); Xaa at res.79 = (Ser, Asn, Asp or Glu); Xaa at res.80 = (Asn, Thr or 
Lys); Xaa at res.82 = (lie or Val); Xaa at res.84 = (Lys or Arg); Xaa at res.85 = (Lys, Asn, Gin or His); Xaa at res.86 = 
20 (Tyr or His); Xaa at res.87 = (Arg, Gin or Glu); Xaa at res.88 = (Asn, Glu or Asp); Xaa at res.90 = (Val, Thr or Ala); Xaa 
at res. 92 = (Arg, Lys, Val, Asp or Glu); Xaa at res.93 = (Ala, Gly or Glu); and Xaa at res.97 = (His or Arg); and Generic 
Seq. 4: 



Generic Sequence 4 



Cys 


Xaa 


Xaa 


Xaa 


Xaa 


Leu 


Tyr Val 


Xaa 


Phe 


1 








5 










10 


Xaa 


Xaa 


Xaa 


Gly 


Tip 


Xaa 


Xaa 


Trp 


Xaa 












15 












Xaa 


Ala 


Pro 


Xaa 


Gly 


Xaa 


Xaa 


Ala 






20 










25 










Xaa 


Tyr 


Cys 


Xaa 


Gly 


xaa 


Cys 


Xaa 










30 










35 







40 



45 



50 



55 
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Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
60 65 

Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 
75 80 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
90 95 

Xaa Cys Gly Cys Xaa 
100 



wherein each Xaa is independently selected from a group of one or more specified amino acids as defined by the 
30 following: "Res." means "residue” and Xaa at res.2 = (Lys or Arg); Xaa at res.3 = (Lys or Arg); Xaa at res.4 = (His or 
Arg); Xaa at res.5 = (Glu, Ser, His, Gly, Arg or Pro); Xaa at res.9 = (Ser, Asp or Glu); Xaa at res. 1 1 = (Arg, Gin, Ser or 
Lys); Xaa at res.12 = (Asp or Glu); Xaa at res.13 = (Leu or Val); Xaa at res.16 = (Gin, Leu, Asp, His or Asn); Xaa at 
res. 17 = (Asp, Arg, or Asn); Xaa at res.19 = (lie or Val); Xaa at res.20 = (lie or Val); Xaa at res.23 = (Glu, Gin, Leu, 
Lys, Pro or Arg); Xaa at res.25 = (Tyr or Phe); Xaa at res.26 = (Ala, Ser, Asp, Met, His, Leu, or Gin); Xaa at res.28 = 
35 (Tyr, Asn or Phe); Xaa at res.31 = (Glu, His, Tyr, Asp or Gin); Xaa at res.33 = Glu, Lys, Asp or Gin); Xaa at res.35 = 

(Ala, Ser or Pro); Xaa at res.36 = (Phe, Leu or Tyr); Xaa at res.38 = (Leu or Val); Xaa at res.39 = (Asn, Asp, Ala or 
Thr); Xaa at res.40 = (Ser, Asp, Glu, Leu or Ala); Xaa at res.41 = (Tyr, Cys, His, Ser or lie) ; Xaa at res.42 = (Met, Phe, 
Gly or Leu); Xaa at res.44 = (Ala, Ser or Gly); Xaa at res.45 = (Thr, Leu or Ser); Xaa at res.49 = (lie or Val); Xaa at res. 
50 = (Val or Leu); Xaa at res.51 = (Gin or Arg); Xaa at res.52 = (Thr, Ala or Ser); Xaa at res.54 = (Val or Met); Xaa at 
40 res.55 = (His or Asn); Xaa at res.56 = (Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.57 = (lie, Met, Asn, Ala or Val); Xaa 

at res.58 = (Asn, Lys, Ala or Glu); Xaa at res.59 = (Pro or Ser); Xaa at res.60 = (Glu, Asp, or Gly); Xaa at res.61 = (Thr, 

Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at res.62 = (Val, Ala or lie); Xaa at res.63 = (Pro or Asp); Xaa at res.64 = (Lys 
or Leu); Xaa at res.65 = (Pro or Ala); Xaa at res.68 = (Ala or Val); Xaa at res.70 = (Thr or Ala); Xaa at res.71 = (Gin, 
Lys, Arg or Glu); Xaa at res. 72 = (Leu, Met or Val); Xaa at res.73 = (Asn, Ser or Asp); Xaa at res.74 = (Ala, Pro or 
45 Ser); Xaa at res.75 = (lie, Thr or Val); Xaa at res.76 = (Ser or Ala); Xaa at res.77 = (Val or Met); Xaa at res.79 = (Tyr 
or Phe); Xaa at-res.80 = (Phe, Tyr or Leu); Xaa at res.81 = (Asp or Asn); Xaa at res.82 = (Asp, Glu, Asn or Ser); Xaa 
at res. 83 = (Ser, Gin, Asn or Tyr); Xaa at res.84 = (Ser, Asn, Asp or Glu); Xaa at res.85 = (Asn, Thr or Lys); Xaa at res. 
87 = (lie or Val); Xaa at res.89 = (Lys or Arg); Xaa at res.90 = (Lys, Asn, Gin or His); Xaa at res.91 = (Tyr or His); Xaa 
at res.92 = (Arg, Gin or Glu); Xaa at res.93 = (Asn, Glu or Asp); Xaa at res.95 = (Val, Thr or Ala); Xaa at res.97 = (Arg, 
50 Lys, Val, Asp or Glu); Xaa at res.98 = (Ala, Gly or Glu); and Xaa at res.102 = (His or Arg). 

[0030] Particularly useful sequences for use as morphogens include the C-terminal domains, e.g., the C-terminal 
96-102 amino acid residues of Vgl, Vgr-1 , DPP, OF-1 , OP-2, CBMP-2A, CBMP-2B and GDF-1 (see Table II, infra, and 
Seq. ID Nos. 5-1 4) which include at least the conserved six or seven cysteine skeleton. In addition, biosynthetic con- 
structs designed from the generic sequences, such as COP-1, 3-5, 7, 16 (see Table III, infra) also are useful. Other 
55 sequences include the C-terminal CBMP3 and the inhibins/activin proteins (see, for example, U.S. Pat. Nos. 4,968,590 
and 5,011,691). Accordingly, other useful sequences are those sharing at least 70% amino acid sequence homology, 
and preferably 80% homology with any of the sequences above. These are anticipated to include allelic and species 
variants and mutants, and biosynthetic muteins, as well as novel members of this morphogenic family of proteins. 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



. 50 



55 



Particularly envisioned in the family of related proteins are those proteins exhibiting morphogenic activity and wherein 
the amino acid changes from the preferred sequences include conservative changes, e.g., those as defined by Dayoff 
et at., Atlas of Protein Sequence and Structure; vol. 5, Suppl. 3, pp. 345-362, (M.O. Dayoff, ed., Nat'l BioMed. Research 
Fdn., Washington, D.C. 1979). 

[0031 ] The currently most preferred protein sequences useful as morphogens include those having greater than 60% 
identity, preferably greater than 65% identity, with the amino acid sequence defining the conserved six cysteine skeleton 
of hOPI (e.g., residues 43-139 of Seq. ID No. 5). These most preferred sequences include both allelic and species 
variants of the OP1 and OP2 proteins. 

[0032] Thus, described herein are proteins comprising any of the polypeptide chains described above, whether iso- 
lated from naturally-occurring sources, or produced by recombinant DNA techniques, and includes allelic and species 
variants of these proteins, naturally-occurring or biosynthetic mutants thereof, as well as various truncated and fusion 
constructs. Deletion or addition mutants also are envisioned to be active (see infra), including those which may alter 
the conserved C-terminal cysteine skeleton, provided that the alteration does not functionally disrupt the relationship 
of these cysteines in the folded structure. Accordingly, such active forms are considered the equivalent of the specifically 
described constructs disclosed herein. The proteins may include forms having varying glycosylation patterns, varying 
N-termini, a family of related proteins having regions of amino acid sequence homology, and active truncated or mutated 
forms of native or biosynthetic proteins, produced by expression of recombinant DNA in host cells. 

[0033] The morphogenic proteins can be expressed from intact or truncated cDNA or from synthetic DNAs in pro- 
caryotic or eucaryotic host cells, and purified, cleaved, refolded, and dimerized to form morphogenicaliy active com- 
positions. Currently preferred host cells include E. coli or mammalian cells, such as CHO, COS or BSC cells. 

[0034] Thus, in view of this disclosure, skilled genetic engineers can isolate genes from cDNA or genomic libraries 
of various different species which encode appropriate amino acid sequences, or construct DNAs from oligonucleotides, 
and then can express them in various types of host cells, including both procaryotes and eucaryotes, to produce large 
quantities of active proteins capable of inducing tissue-specific cell differentiation and tissue morphogenesis in a variety 
of mammals including humans. 

[0035] A further utility described herein comprises inducing tissue-specific morphogenesis using the morphogenic 
proteins and pharmaceutical and therapeutic agents comprising the morphogens. Also defined (herein is) the use of 
these morphogens in the manufacture of pharmaceuticals for various medical procedures, including procedures for 
inducing tissue growth, procedures for inducing progenitor cell proliferation, procedures to inhibit neoplasm growth 
and procedures to promote phenotypic cell expression of differentiated cells. 

Brief Description of the Drawings 

[0036] The foregoing and other objects and features of this invention, as well as the invention itself, may be more 
fully understood from the following description, when read together with the accompanying drawings, in which: 

FIGURE 1 is a photomicrograph of a Northern Biot identifying Vgr-1 specific transcripts in various adult murine 
tissues; 

FIGURE 2 is a photomicrograph of a Northern Blot identifying mOP-1 -specific mRNA expression in various murine 
tissues prepared from 2 week old mice (panel A) and 5 week old mice (Panel B); 

FIGURE 3 is a photomicrograph of Northern Blots identifying mRNA expression of EF-Tu (A, control), mOP-1 (B, 
D), and Vgr-1 (C) in (1) 1 7-day embryos and (2) 3-day post natal mice; 

FIGURE 4A and 4B are photomicrographs showing the presence of OP-1 (by immunofluorescence staining) in 
the cerebral cortex (A) and spinal cord (B); 

FIGURE 5A and 5B are photomicrographs illustrating the ability of morphogen (OP-1) to induce undifferentiated 
NG108 calls (5A) to undergo differentiation of neural morphology (5B). 

FIGURE 6A-6D are photomicrographs showing the effect of morphogen (OP-1 ) on human embryo carcinoma cell 
redifferentiation; 

FIGURE 7 is a photomicrograph showing the effects of phosphate buffered saline (PBS, animal 1) or morphogen 
(OP-1, animal 2) on partially hepatectomized rats; 

FIGURE 8A - 8C are photomicrographs showing the effect of no treatment (8A), carrier matrix treatment (8B) and 
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morphogen treatment (OP-1, 8C) on dentin regeneration. 

Detailed Description 

5 [0037] Purification protocols first were developed which enabled isolation of the osteogenic (bone inductive) protein 

present in crude protein extracts from mammalian bone. (See WO-A-89 09787, and U.S. 4,968,590.) The development 
of the procedure, coupled with the availability of fresh calf bone, enabled isolation of substantially pure bovine osteo- 
genic protein (BOP). BOP was characterized significantly; its ability to induce cartilage and ultimately endochondral 
bone growth in cat, rabbit, and rat were demonstrated and studied; it was shown to be able to induce the full develop- 
io mental cascade of bone formation previously ascribed to unknown protein or proteins in heterogeneous bone extracts. 
This dose dependent and highly specific activity was present whether or not the protein was glycosylated (see U.S. 
Patent No. 4, 968, 958, filed 4/8/88 and Sampath et aL, (1 990) J. Biol. Chem. 265 : pp. 13198-13205). Sequence data 
obtained from the bovine materials suggested probe designs which were used to isolate genes encoding osteogenic 
proteins from different species. Human and murine osteogenic protein counterparts have now been identified and 
15 characterized (see, for example, U.S. Pat. No. 5,011 ,691 , Ozkaynak, et aL, (1 990) EMBO J 9 :2085-2093, and Ozkaynak 
et al., (1991) Biochem. Biochys. Res. Commn. 179 :116-123, the disciosures of which are herein incorporated by ref- 
erence.) 

[0038] Sequence data from the bovine materials also suggested substantial homology with a number of proteins 
known in the art which were not known to play a role in bone formation. Bone formation assays performed with these 
20 proteins showed that, when these proteins were implanted in a mammal in association with a suitable matrix, cartilage 

and endochondral bone formation was induced (see, for example, U.S. Patent No. 5,011 ,691.) One of these proteins 
is DPP, a Drosophila protein known to play a role in dorsal-ventral specification and required for the correct morpho- 
genesis of the imaginal discs. Two other proteins are related sequences identified in Xenopus and mouse (Vgl and 
Vgr-1 , respectively), thought to play a role in the control of growth and differentiation during embryogenesis. While 
25 DPP and Vgr-1 (or Vgr-1 -like) transcripts have been identified in a variety of tissues (embryonic, neonatal and adult, 
Lyons et al., (1 989) PNAS 86 :4554-4 558, and see infra), Vgl transcripts, which are maternally inherited and spacially 
restricted to the vegetal endoderm, decline dramatically after gastrulation. 

[0039] From these homologies a generic consensus sequence was derived which encompasses the active sequence 
required for inducing bone morphogenesis in a mammal when implanted in association with a matrix. The generic 
30 sequence has at least a conserved six cysteine skeleton (Generic Sequence 1 , Seq. ID No. 1) or, optionally, a 7-cysteine 

skeleton (Generic Sequence 2, Seq. ID No. 2), which includes the conserved six cysteine skeleton defined by Generic 
Sequence 1 , and an additional cysteine at residue 36, accomodating the additional cysteine residue identified in the 
OP2 proteins. Each "Xaa" in the generic sequences indicates that any one of the 20 naturally-occurring L-isomer, <*- 
amino acids or a derivative thereof may be used at that position. Longer generic sequences which also are useful 
35 further comprise the following sequence at their N-termini: 



Cys Xaa Xaa Xaa Xaa (Seq. ID No. 15) 

40 • 1 5 

[0040] Biosynthetic constructs designed from this generic consensus sequence also have been shown to induce 
cartilage and/or endochondral bone formation (e.g., COP-1, COP-3, COP-4, COP-5, COP-7 and COP-16, described 
45 jn U.S. Patent No. 5,011,691 and presented below in Table III.) Table II, set forth below, compares the amino acid 
sequences of the active regions of native proteins that have been identified as morphogens, including human OP-1 
(hOP-1 , Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), human and mouse OP-2 (Seq. 
ID Nos. 7, 8, and 20-22), CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), DPP (from Drosophila, Seq. ID No. 
11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), and GDF-1 (Seq. ID No. 14.) In the 
50 table, three dots indicates that the amino acid in that position is the same as the amino acid in hOP-1 . Three dashes 
indicates that no amino acid is present in that position, and are included for purposes of illustrating homologies. For 
example, amino acid residue 60 of CBMP-2A and CBMP-2B is “missing". Of course, both these amino acid sequences 
in this region comprise Asn-Ser (residues 58, 59), with CBMP-2A then comprising Lys and He, whereas CBMP-2B 
comprises Ser and lie. 

55 
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TABLE II 



5 


hOP-l 


Cys 


Lys 


Lys 


His 


Glu 


Leu 


iyr 


Val 




mOP~l 


♦ • • 


• • • 


• • • 


• • • 


• ft « 




ft ' ft ft 


ft ft ft 




hOP-2 




Arg 


Arg 


♦ ft ft 


• ft ft 








10 


mOP-2 




Arg 


Arg 


• ft ft 


• ft • 










DPP 




Arg 


Arg 


. . . 


Ser 










Vgl 




... 


Lys 


Arg 


His 








15 


Vgr-1 




. . . 


. . . 


• ft ft 


Gly 








CBHP-2A 




. • • . 


Arg 


ft ft ft 


Pro 










CBHP-2B 




Arg 


Arg 


• ft • 


Ser 






ft ft ft 




GDF-1 


... 


Arg 


Ala 


Arg 


Arg 


* # • 


ft ft ft 


ft ft ft 


20 




1 








5 









25 


hOP- 1 


Ser 


Phe 


Arg 


Asp 


Leu 


Gly 


Trp 


Gin 


Asp 




mOP-1 


• ft ft 




• • • 


. . . 


• ft ft 


• • ft 


... 


... 


ft ft ft 




hOP-2 


ft ft ft 




Gin 


ft ft ft 


ft ft • 






Leu 


... 




mOP-2 


Ser 




ft ft ft 


ft ft i 


ft ft ft 






Leu 


ft ft ft 


30 


DPP 


Asp 




Ser 


* • % 


Val 






Asp 


ft • ft 




Vgl 


Glu 




Lys 


ft • ft 


Val 






• ft • 


Asn 




Vgr-1 


ft ft ft 




Gin 


• ♦ ft 


Val 






• ft ft 


• « • 


35 


CBMP-2A 


Asp 




Ser 


• • ft 


Val 






Asn 


• « « 




CBMP-2B 


Asp 




Ser 


« » • 


Val 






Asn 


• • # 


40 


GDF-1 




10 




Glu 


Val 




15 


His 


Arg 




hOP-l 


Trp 


lie 


He 


Ala 


Pro 


Glu 


Gly 


Tyr 


Ala 




nOP-1 




* • • 


. . . 


• • ft 


. . . 


• • ft 






• • • 


45 


hOP-2 




Val 


ft ft ft 


• ♦ ft 


. . . 


Gin 






Ser 




mOP-2 




Val 


• • ft 


♦ • ft 


. . . 


Gin 






Ser 




DPP 




• • • 


Val • 


• • ft 


. . . 


Leu 






Asp 


50 


Vgl 




Val 


... 


« • ft 




Gin 






Met 




Vgr-1 




• • • 


• • • 


• * ft 


• • • 


Lys 




• . • 


, , , 
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CBMP-2A 


• • ♦ 




Val 


• . • 


... 


Pro 


... 


9 9 9 


His 




CBMP-2B 


• • • ■ 


. . . 


Val 


... 


... 


Pro 


... 


9 9 9 


Gin 


5 


GDF-1 




Val 


• . • 


... 


. . . 


Arg 


• • • 


Phe 


Leu 










20 










25 




10 










V 














hOP-1 


Ala 


Tyr 


Tyr 


Cys 


Glu 


Gly 


Glu 


Cys 


Ala 




mOP-1 


• • • 


t • • 


• * • 




• • • 


• • • 


• • • 


9 9 9 




15 


hOP-2 


• • • 


• • • 


• • • 




♦ • • 




• • • 


9 9 9 


Ser 


mOP-2 


... 


• ft 


• • • 




• 99 




♦ • ♦ 


-9 9 9 


• • • 




DPP 


• * • 


• • • 


• • 9 




His 




Lys 


9 9 9 


Pro 




Vgl 


... 


Asn 






Tyr 




9 9 9 


9 9 9 


Pro 


20 


Vgr-1 




Asn 


... 




Asp 




9 9 9 


9 9 9 


Ser 




CBMP-2A 


• • ♦ 


Phe 






His 




Glu 


9 9 9 


Pro 




CBMP-2B 


« « t 


Phe 


... 




His 




Asp 


9 9 9 


Pro 


25 


GDF-1 


* . . 


Asn 






Gin 




Gin 


• 99 


9 9 9 












30 










35 




hOP-1 


Phe 


Pro 


Leu 


Asn 


Ser 


Tyr 


Met 


Asn 


Ala 


30 


mOP-1 


• • • 


9 9 9 


. . . 


• * • 


... 


• • • 


• • • 


... 


• • • 




hOP-2 


• • • 


9 9 9 


• • • 


Asp 


. . . 


Cys 


♦ * • 


... 


• • • 




mOP-2 


... 


9 9 9 




Asp 


. . . 


Cys 


• 99 


... 


• • • 


35 


DPP 


... 


9 9 9 


• . . 


Ala 


Asp 


His 


Phe 


... 


Ser 




Vgl 


Tyr 


. . . 


• • • 


Thr 


Glu 


He 


Leu 


... 


Gly 




Vgr-1 






• • • 


• • • 


Ala 


His 


• t # 


... 


... 


40 


CBMP-2A 


. . . 




• 99 


Ala 


Asp 


His 


Leu 


... 


Ser 


CBMP-2B 


• . . 




9 9 9 


Ala 


Asp 


His 


Leu 


... 


Ser 




GDF-1 


Leu 


• • • 


Val 


Ala 


Leu 


Ser 


Gly 


Ser** 


... 












* 


AO 










45 
























hOP-1 


Thr 


Asn 


His 


Ala 


lie 


Val 


Gin 


Thr 


Leu 




mOP-1 


• 9 • 




... 


. . . 


. . . 


• • . 


• » • 


... 


• ft 


50 


hOP-2 


9 9 9 


. . • 


• ft 


9 9 9 


... 


Leu 


9 9 9 


Ser 


• * • 


mOP-2 


9 9 9 


... 


.• • 9 


9 9 9 


. . . 


Leu 


9 9 9 


Ser 


... 




DPP 


• 9 9 


... 


. . . 


•99 


Val 


... 


9 9 9 


• • • 
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Vgl 


Ser 


• # • 


. . . 


. . . 


. . . 


Leu 


t • • 


... 




5 


Vgr-l 


• • • 


• « « 


. . . 


. . . 


• • • 


... 


. # . 


... 






CBHP-2A 


♦ ♦ • 


• « • 


. . . 


• • . 


• • * 


• • • 


• • • 


... 






CBMP-2B 


... 


« • » 


. . . 


. . . 


. . . 


• • • 


• • • 








GDF-1 


Leu 




. . . 


. . . 


Val 


Leu 


Arg 


Ala 




10 




45 










50 










hOP-1 


Val 


His 


Phe 


lie 


Asn 


Pro 


Glu 


Thr 


Val 


15 


mOP-1 


... 


... 




. . . 


. . . 


• • • 


Asp 


... 


9 • e 




hOP-2 


• * ’ 


His 


Leu 


Met 


Lys 


• • • 


Asn 


Ala 


• • • 




mOP-2 




His 


Leu 


Met 


Lys 


• . ♦ 


Asp 


Val 


• • • 




DPP 


• • ♦ 


Asn 


Asn 


Asn 


• • • 


• • • 


Gly 


Lys 


« * • 


20 


Vgl 


• + • 


♦ • • 


Ser 




Glu 


m • • 


• f • 


Asp 


He 




Vgr-l 


• ♦ ♦ 


• • • 


Val 


Met 


. . . 


• • % 


• . • 


Tyr 


... 




CBKP-2A 




Asn 


Ser 


Val 




Ser 


— 


Lys 


lie 


25 


CBMP-2B 


... 


Asn 


Ser 


Val 


... 


Ser 


— 


Ser 


lie 




GDF-I 


Met 




Ala 


Ala 


Ala 


. . • 


Gly 


Ala 


Ala 








55 










60 






30 
























hOP-l 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 




mOP-:i 




. . . 


♦ • • 




. . • 


• • • 




... 


• • • 


35 


hOP-2 


... 


. . . 


Ala 




. . . 


• • • 






Lys 




mOP-2 




... 


Ala 






• • • 






Lys 




DPP 


... 




Ala 




• • . 


Val 






• « « 


40 


Vgl 


... 


Leu 


... 




... 


Val 






Lys 




Vgr-l 


• • . 


• . • 


... 




... 


... 






Lys 




CBMP-2A 


. . . 


. . . 


Ala 




• • . 


Val 






Glu 




CBMP-2B 




... 


Ala 




. . • 


Val 






Glu 


45 


GDF-1 


Asp 


Leu 


... 




. . . 


Val 




Ala 


Arg 










65 










70 




50 
























hOP-1 


Leu 


Asn 


Ala 


lie 


Ser 


Val 


Leu 


iyr 


Phe 




mOP-1 


. • • 


. . . 


... 


. . . 


• t • 


... 


9 • • 


... 





55 



14 




EP 0 575 555 B1 



hOP-2 




Ser 


• ft ft 


Thr 


• . . 


• • • 




* • * 


Tyr 


mOF-2 


• • • 


Ser 


ft « • 


Thr 


. . . 


• • • 




» * * 


Tyr 


Vgl 


Met 


Ser 


Pro 


. . . 


• ft ft 


Met 




Phe 


Tyr 


Vgr-1 


Val 


• • • 


• • • 


. . . 


• ft • 


• • • 




... 


• ’ * • 


DPP 


... 


Asp 


Ser 


Val 


Ala 


Met 




• » # 


Leu 


CBMP-2A 


. . - 


Ser 


• • • 


• ft 


♦ ft ft 


Met 




• ft 


Leu 


CBMP-2B 


• . . 


Ser 


# • • 


• • ft 


ft ft ft 


Met 


. • * 


. . . 


Leu 


GDF-1 


• • • 


Ser 


Pro 


• • ft 

75 


ft m • 


# ♦ ♦ 


. . . 


Phe 


■ • • # 
80 


hOP-1 


Asp 


Asp 


Ser 


Ser 


Asn 


Val 


He 


Leu 


Lys 


mOP-1 


• • • 


• • ft 


• • • 


. . . 


• . . 




• • • ■ 




• • • 


hOP-2 


• ♦ • 


Ser 


• ft • 


Asn 


. . . 




• * • 




Arg 


mOP-2 


• • • 


Ser 


• ft • 


Asn 


. . . 




• • • 




Arg 


DPP 


Asn 


• • • 


Gin 


• • • 


Thr 




Val 




• • . 


Vgl 


• . i 


Asn 


Asn 


Asp 


. . . 




Val 




Arg 


Vgr-1 


• • ♦ 


. . . 


Asn 


• • • 


• • • 




• • • 




• • ♦ 


CBMP-2A 


. . . 


Glu 


Asn 


Glu 


Lys 




Val 




• • • 


CBMP-2B 


. . . 


Glu 


Tyr 


Asp 


Lys 




Val 




♦ * • 


GDF-1 




Asn 


• • • 


Asp 


85 




Val 




Arg 



hOP-1 


Lys 


Tyr 


Arg 


Asn 


Met 


Val 


Val 


Arg 


mOP-1 


• « • 




... 


. . . 


. . . 


... 




• • • 


hOP-2 


• • • 


His 


• . • 


. . . 




ft* 




Lys 


mOP-2 


• • • 


His 


. . . 


» • • 




• • • 




Lys 


DPP 


Asn 




Gin 


Glu 




Thr 




Val 


Vgl 


His 




Glu 


• • • 




Ala 




Asp 


Vgr-1 


. . . 




. . . 






# • • 




ft • • 


CBMP-2A 


Asn 




Gin 


Asp 




• « • 




Glu 


CBMP-2B 


Asn 




Gin 


Glu 




• • • 




Glu 


GDF-1 


Gin 

90 




Glu 


Asp 




• • * 
95 




Asp 
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hOP-l 


Ala 


Cys 


Gly 


Cys 


His 


mOP-1 


a • • 








a a a 


hOP-2 


. a • 








• . a 


mOP-2 










a a a 


DPP 


Gly 








Arg 


Vgl 


Glu 








Arg 


Vgr-1 


... 








. . . 


CBMP-2A 


Gly 








Arg 


CBMP-2B 


Gly 








Arg 


GDF-1 


Glu 


— 


100 


. . . 


Arg 



★★Between residues 43 and 44 of GDF-1 lies the amino acid 
sequence Gly-Gly-Pro-Pro. 

[0041] Table III, set forth below, compares the amino acid sequence data for six related biosynthetic constructs 
designated COPs 1, 3, 4, 5, 7, and 16. These sequences also are presented in U.S. Pat. No. 5,011,691. As with Table 
II, the dots mean that in that position there is an identical amino acid to that of COP-1 , and dashes mean that the COP- 
1 amino acid is missing at that position. 



TABLE III 



COP-1 Leu Tyr Val Asp 

COP-3 

COP-4 

COP-5 

COP- 7 

COP- 16 



Phe Gin Arg Asp Val 

• • • * * • ■ a a » i i a • • 

Ser 

• • a Ser — • • a • • a 

. . . Ser — 

■ • * Ser — — — a a • • a • 
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COP-1 


Gly 


Trp 


Asp 


Asp 


Trp 


He 


He 


Ala 


COP- 3 




• . . 


ft ft • 


• . . 




. . . 


Val 




COP-4 




ft ft ft 


ft • ft 


... 




• • • 


Val 


. . . 


COP-5 




ft ft • 


ft • ♦ 


... 


» ♦ • 


. . . 


Val 




COP- 7 




ft • ft 


Asn 


. . . 


• • • 




Val 




COP- 16 


10 




Asn 


... 




15 


Val 


... 


COP-1 


Pro 


Val 


Asp 


Phe 


Asp 


Ala 


Tyr 


Tyr 


COP-3 


. . . 


Pro 


Gly 


Tyr 


Gin 


. . . 


Phe 


. . . 


COP-4 


• ft ft 


Pro 


Gly 


Tyr 


Gin 


. . . 


Phe 




COP-5 


ft • ft 


Pro 


Gly 


Tyr 


Gin 


. . . 


Phe 




COP -7 


• • • 


Pro 


Gly 


Tyr 


His 


. . . 


Phe 


. . . 


COP- 16 


... 


Pro 


Gly 

20 


Tyr 


Gin 




Phe 


« • » 
25 


COP-1 


Cys 


Ser 


Gly 


Ala 


Cys 


Gin 


Phe 


Pro 



30 


COP-3 


ft ft ft 


ft • • 


• ft • 




... 




• « « 






COP -4 


ft ft • 


ft ft ft 


ft ft • 


— 


... 


. . . 


• # ft . 






COP- 5 


• • • 


His 


. . . 


Glu 


. . . 


Pro 








COP-7 


. . . 


His 


. . . 


Glu 


• • • 


Pro 


. . . 




35 


COP- 16 


• • • 


His 


... 


Glu 


... 


Pro 


. . • 














30 










40 


COP-1 


Ser 


Ala 


Asp 


His 


Phe 


Asn 


Ser 


Thr 




COP-3 


• * • 


. . . 




. . . 




. . . 




ft ft ft 




COP-4 


• • • 


. . . 








. . . 


... 


• ft • 




COP- 5 


Leu 


ft ft • 




... 


... 


... 


... 


ft • ft 


45 


COP- 7 


Leu 


ft ft ft 




. . . 


Leu 


... 


... 


• ft • 




COP- 16 


Leu 


ft ft ft 




... 


ft ft • 


• • • 


• ■ . 


ft ft ft 



35 40 
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COP-1 


Asn 


His 


Ala 


Val 


Val 


Gin 


Thr 


Leu 


5 


COP-3 


... 


... 


• . . 


. . • 


• . . 


• • 4 


• 4 4 


• • 4 




COP-4 


• . . 


• • • 


. . . 


• • • 


. . . 


. . • 


... 


... 




COP-5 


• . . 




• • • 


• • . 


• . . 


. • . 


4.4 


« * • 


10 


COP- 7 






• • * 


• • • 






• m • 


• • % 


15 


COP- 16 




• • • 


• • • 


45 






• • t 


• •• 




COP-1 


Asn 


Asn 


Met 


Asn 


Pro 


Gly 


Lys 


Val 




COP-3 


• . . 


. . . 


. . . 


. . . 


. . . 


• • • 


• • • 


• • • 


20 


COP- 4 


• • • 


• * * 


. . . 


• . . 


« » • 


• ♦ * 


» • • 


... 


COP-5 


• . . 


Ser 


Val 


. . . 


Ser 


Lys 


He 


— 




COP-7 


• • • 


Ser 


Val 


. . . 


Ser 


Lys 


lie 


— 


25 


COP- 16 




Ser 


Val 


... 


Ser 

55 


. Lys 


lie 


— 



30 


COP-1 


Pro 


Lys 


Pro 


Cys 


Cys 


Val 


Pro 


Thr 




COP-3 


• * • 


• • • 


• • • 


. . . 


... 


. . . 




4 4 4 




COP-4 


• • • 


• • • 


* • • 


• • • 




• 4 4 




... 




COP -5 


.... 


• • • 


Ala 


• 4 4 


... 


... 




4 4 4 


35 


COP-7 


4 • 4 


• . . 


Ala 


• 4 4 




4 4 4 




4 4 4 


40 


COP- 16 

/ 




60 


Ala 








65 






COP-1 


Glu 


Leu 


Ser 


Ala 


He 


Ser 


Met 


Leu 


45 


COP-3 








... 












COP- 4 


4 4 4 




... . 


4 4 4 


4.4 


4 4 4 


- 4 • 


4 4 4 




COP-5 




. 4 4 


. . . 


4 4 4 


. 4 4 


• 4 4 


... 


4 4 4 




COP- 7 


• • ♦ 


• • • 


• . 4 


• 4 4 


• 4 4 


• 4 4 


4 4 4 


• 44 


50 


COP- 16 


• • • 


• • • 


4 4 4 


4 4 4 


4 4 4 


4 4 4 




... 



55 
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COP-1 Tyr Leu Asp Glue Asn Ser Thr Val 

COP-3 ... . .. . Glu Lys . .. 

COP-4 ••• ••• ••• •«« » • ■ Glu Lys . . . 

COP— 5 ••• » • • ••• • * • ••• Glu Lys • • • 

COP— 7 ••• « • • ••• ••• Glu Lys . . . 

COP- 16 *•* ••• . • • ■ t • • Glu Lys • • • 

75 80 



20 



25 



30 



35 



40 



COP-1 Val Leu Lys Asn Tyr Gin Glu Met 
COP-3 ••• *«. • • » ... ... 
COP— 4 ••• ... • • * ••• • . . . .. 
COP- 5 ••• ... ... ... ... ... ... 
COP — 7 ••• **• • • • ••• ... ... ... 



COP- 16 






85 




... 




• * * 


90 


COP-1 


Thr 


Val 


Val 


Gly 


Cys 


Gly 


Cys 


Arg 


COP-3 


Val 




Glu 




. . . 








COP -4 


Val 


• . . 


Glu 




• • • 






• ♦ • 


COP-5 


Val 


. . . 


Glu 




• • • 






• * • 


COP- 7 


Val 


• • . 


Glu 




• • • 






« • « 


COP- 16 


Val 


. . , 


Glu 




... 









95 



[0042] As is apparent from the foregoing amino acid sequence comparisons, significant amino acid changes can be 
made within the generic sequences while retaining the morphogenic activity. For example, while the GDF-1 protein 
45 sequence depicted in Table II shares only about 50% amino acid identity with the hOPI sequence described therein, 
the GDF-1 sequence shares greater than 70% amino acid sequence homology with the hOPI sequence, where ho- 
mology is defined by allowed conservative amino acid changes within the sequence as defined by Dayoff, et al. t Atlas 
of Protein Sequence and Structure vol.5, supp.3, pp.345-362, (M.O. Dayoff, ed., Nat'l BioMed. Res. Fd’n, Washington 
D.C. 1979.) 

so [0043] It now has been discovered that the family of proteins described by these sequences also is capable of initiating 
and maintaining the tissue-specific developmental cascade in tissues other than bone and cartilage. When combined 
with naive progenitor cells as disclosed herein, these proteins, termed morphogens, are capable of inducing the pro- 
liferation and differentiation of the progenitor cells. In the presence of appropriate tissue-specific signals to direct the 
differentiation of these cells, and a morphogenically permissive environment, these morphogens are capable of repro- 
55 ducing the cascade of cellular and molecular events that occur during embryonic development to yield functional tissue. 
[0044] A key to these developments was the creation of a mammalian tissue model system, namely a model system 
for endochondral bone formation, and investigation of the cascade of events important for bone tissue morphogenesis. 
Work on this system has enabled discovery not only of bone inductive morphogens, but also of tissue inductive mor- 
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phogens and their activities. The methods used to develop the bone model system, now well-known in the art, along 
with the proteins of this invention, can be used to create model systems for other tissues, such as liver (see infra). 
[0045] Using the model system for endochondral bone formation, it also has been discovered that the local environ- 
ment in which the morphogenic material is placed is important for tissue morphogenesis. As used herein, “local envi- 
ronment" is understood to include the tissue structural matrix and the environment surrounding the tissue. For example, 
in addition to needing an appropriate anchoring substratum for their proliferation, the morphogen-stimulated cells need 
signals to direct the tissue-specificity of their differentiation. These signals vary for the different tissues and may include 
cell surface markers. In addition, vascularization of new tissue requires a local environment which supports vascular- 
ization. 

Using the bone model system as an example, it is known that, under standard assay conditions, implanting osteoin- 
ductive morphogens into loose mesenchyme in the absence of a tissue-specifying matrix generally does not result in 
endochondral bone formation unless very high concentrations of the protein are implanted. By contrast, implanting 
relatively low concentrations of the morphogen in association with a suitably modified bone-derived matrix results in 
the formation of fully functional endochondral bone (see, for example, Sampath et al. (1981) PNAS 78:7599-7 603 and 
U.S. Patent No. 4,975,526). In addition, a synthetic matrix comprised of a structural polymer such as tissue-specific 
collagen and tissue-specific cell attachment factors such as tissue-specific glycosylaminoglycans, will allow endochon- 
dral bone formation (see, for example, PCT publication US91/03603, published December 12, 1991 (WO 91/18558). 
Finally, if the morphogen and a suitable bone or cartilage-specific matrix (e.g., comprising Type I cartilage) are implanted 
together in loose mesenchyme, cartilage and endochondral bone formation will result, including the formation of bone 
marrow and a vascular system. However, if the same composition is provided to a nonvascular environment, such as 
to cultured cells in vitro or at an cartilage-specific locus, tissue development does not continue beyond cartilage for- 
mation (see infra). Similarly, a morphogenic composition containing a cartilage-specific matrix composed of Type 2 
collagen is expected to induce formation of non-cartilage tissue in vivo (e.g., hyaline). However, if the composition is 
provided to a vascular-supporting environment, such as loose mesenchyme, the composition is capable of inducing 
the differentiation of proliferating progenitor cells into chondrocytes and osteoblasts, resulting in bone formation. 
[0046] It also has been discovered that tissue morphogenesis requires a morphogenically permissive environment. 
Clearly, in fully-functioning healthy tissue that is not composed of a permanently renewing cell population, there must 
exist signals to prevent continued tissue growth. Thus, it is postulated that there exists a control mechanism, such as 
a feedback control mechanism, which regulates the control of cell growth and differentiation. In fact, it is known that 
both TGF-p, and MIS are capable of inhibiting cell growth when present at appropriate concentrations. In addition, 
using the bone model system it can be shown that osteogenic devices comprising a bone-derived carrier (matrix) that 
has been demineralized and guanidine-extracted to substantially remove the noncollagenous proteins does allow en- 
dochondral bone formation when implanted in association with an osteoinductive morphogen. If, however, the bone- 
derived carrier is not demineralized but rather is washed only in low salt, for example, induction of endochondral bone 
formation is inhibited, suggesting the presence of one or more inhibiting factors within the carrier. 

[0047] Another key to these developments was determination of the broad distribution of these morphogens in de- 
veloping and adult tissue. For example, DPP is expressed in both embryonic and developing Drosophila tissue. Vgl 
has been identified in Xenopus embryonic tissue. Vgr-1 transcripts have been identified in a variety of murine tissues, 
including embryonic and developing brain, lung, liver, kidney and calvaria (dermal bone) tissue. Recently, Vgr-1 tran- 
scripts also have been identified in adult murine lung, kidney, heart, and brain tissue, with especially high abundance 
in the lung (see infra). 

[0048] OP-1 and the CBMP2 proteins, both first identified as bone morphogens, have been identified in mouse and 
human placenta, hippocampus, calvaria and osteosarcoma tissue as determined by identification of OP-1 and 
CMBP2:Specific sequences in cDNA libraries constructed from these tissues (see Ozkaynak, et al., (1990) EMBO J 
9:2085-^093, and Ozkaynak et al., (1991) Biochem. Biophys. Res. Commn. 179 :116-123). Additionally, the OP-1 pro- 
tein is present in a variety of embryonic and developing tissues including kidney, liver, heart, adrenal tissue and brain 
as determined by Western blot analysis and immunolocalization (see infra). OP-1 -specific transcripts also have been 
identified in both embryonic and developing tissues, most abundantly in developing kidney, bladder and brain (see 
infra). OP-1 also has been identified as a mesoderm inducing factor present during embryogenesis (see infra). More- 
over, OP-1 has been shown to be associated with in satellite muscle cells and associated with pluripotential stem cells 
in bone marrow following damage to adult murine endochondral bone, indicating its morphogenic role in tissue repair 
and regeneration. In addition, the recently identified protein GDF-1 (see Table II) has been identified in neural tissue 
(Lee, (1991) PNAS 88 4250-4254). 
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Exemplification 

IDENTIFICATION AND ISOLATION OF MORPHOGENS 

5 [ 0049 ] Among the proteins useful in this invention are proteins originally identified as bone inductive proteins, such 

as the OP-1, OP-2 and the CBMP proteins, as well as amino acid sequence-related proteins, such as DPP (from 

Drosophila), Vgl (from Xenopus) and Vgr-1 (from mouse, see Table II and Sequence Listing). The members of this 
family, which include particular members of the TGF-p super family of structurally related proteins, share substantial 
amino acid sequence homology in their C-termirial regions. The OP-2 proteins have an extra cysteine residue in this 

io region (position 41 of Seq. ID Nos. 7 and 8), in addition to the conserved cysteine skeleton in common with the other 

proteins in this family. The proteins are inactive when reduced, but are active as oxidized, homodimeric species as 
well as when oxidized in combination with other morphogens. 

[ 0050 ] Accordingly, the morphogens of this invention can be described by either of the following two species of 
generic amino acid sequences: Generic Sequence 1 or Generic Sequence 2, (Seq. ID Nos. 1 and 2), where each Xaa 
is . indicates one of the 20 naturally-occurring L-isomer, a-amino acids or a derivative thereof. Particularly useful sequences 
that fall within this family of proteins include the 96-102 C-terminai residues of Vgl, Vgr-1 , DPP, OP-1, OP-2, CBMP- 
2A, CBMP-2B, and GDF-1, as well as their intact mature amino acid sequences. In addition, biosynthetic constructs 
designed from the generic sequences, such as COP-1 , COP-3-5, COP-7, and COP-1 6 also are useful (see, for example, 
U.S. Pat. No. 5,011,691.) 

20 [0051 ] Generic sequences showing preferred amino acids compiled from sequences identified to date and useful as 

morphogens (e.g., Tables II and III) are. described by Generic Sequence 3 (Seq. ID No. 3) and Generic Sequence 4 
(Seq. ID No. 4). Note that these generic sequences have a 7 or 8-cysteine skeleton where inter- or intramolecular 
disulfide bonds can form, and contain certain critical amino acids which influence the tertiary structure of the proteins. 
It is also contemplated that the differing N-termini of the naturally occurring proteins provide a tissue-specific or other, 
25 important modulating activity of these proteins. 

[ 0052 ] Given the foregoing amino acid and DNA sequence information, the level of skill in the art, and the disclosures 
of U.S. Patent Nos. 4,968,590 and 5,011,691, PCT application US 89/01469, published October 19, 1989 
(WO89/09788), and Ozkaynak, et al. , (1990) EMBO J 9:2085-2093, and Ozkaynak et al., (1991) Biochem. Biophys. 
Res. Commn. 179 :1 1 6-123, various DNAs can be constructed which encode at least the active region of a morphogen 
30 of this invention, and various analogs thereof (including allelic variants and those containing genetically engineered 
mutations), as well as fusion proteins, truncated forms of the mature proteins, deletion and insertion mutants, and 
similar constructs. Moreover, DNA hybridization probes can be constructed from fragments of the genes encoding any 
of these proteins, including seqeunces encoding the active regions or the pro regions of the proteins (see infra), or 
designed de novo from the generic sequence. These probes then can be used to screen different genomic and cDNA 
35 libraries to identify additional morphogenic proteins from different tissues. 

[ 0053 ] The DNAs can be produced by those skilled in the art using well known DNA manipulation techniques involving 
genomic and cDNA isolation, construction of synthetic DNA from synthesized oligonucleotides, and cassette muta- 
genesis techniques. 15-100mer oligonucleotides may be synthesized on a Biosearch DNA Model 8600 Synthesizer, 
and purified by polyacrylamide gel electrophoresis (PAGE) in Tris-Borate-EDTA buffer. The DNA then may be electro- 
40 eluted from the gel. Overlapping oligomers may be phosphorylated by T4 polynucleotide kinase and ligated into larger 
blocks which also may be purified by PAGE. 

[ 0054 ] The DNA from appropriately identified clones then can be isolated, subcloned (preferably into an expression 
vector), and sequenced. Plasmids containing sequences of interest then can be transfected into an appropriate host 
cell for expression of the morphogen and further characterization. The host may be a procaryotic or eucaryotic cell 
45 since the former's inability to glycosylate protein will not destroy the protein's morphogenic activity. Useful host cells 
include E. coli, Saccharomyces , the insect/baculovirus cell system, myeloma cells, and various other mammalian cells. 
The vectors additionally may encode various sequences to promote correct expression of the recombinant protein, 
including transcription promoter and termination sequences, enhancer sequences, preferred ribosome binding site 
sequences, preferred mRNA leader sequences, preferred signal sequences for protein secretion, and the like. 
so [ 0055 ] The DNA sequence encoding the gene of interest also may be manipulated to remove potentially inhibiting 
sequences or to minimize unwanted secondary and tertiary structure formation. The recombinant morphogen also may 
be expressed as a fusion protein. After being translated, the protein may be purified from the cells themselves or 
recovered from the culture medium. All biologically active protein forms comprise dimeric species joined by disulfide 
bonds or otherwise associated, produced by refolding and oxidizing one or more of the various recombinant polypeptide 
55 chains within an appropriate eucaryotic cell or in vitro after expression of individual subunits. A detailed description of 
morphogens expressed from recombinant DNA in E. coli and in numerous different mammalian cells is disclosed in 
PCT publication US90/05903, published May 2, 1991 (W091/05802). 

[0056] Alternatively, morphogenic polypeptide chains can be synthesized chemically using conventional peptide syn- 
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thesis techniques well known to those having ordinary skill in the art. For example, the proteins may be synthesized 
intact or in parts on a Biosearch solid phase peptide synthesizer, using standard operating procedures. Completed 
chains then are deprotected and purified by HPLC (high pressure liquid chromatography). If the protein is synthesized 
in parts, the parts may be peptide bonded using standard methodologies to form the intact protein. In general, the 
5 manner in which the morphogens are made can be conventional and does not form a part of this invention. 

MORPHOGEN DISTRIBUTION 

[0057] The generic function of the morphogens of this invention throughout the life of the organism can be evidenced 
io by their expression in a variety of disparate mammalian tissues. Determination of the tissue distribution of morphogens 
also may be used to identify different morphogens expressed in a given tissue, as well as to identify new, related 
morphogens. The proteins (or their mRNA transcripts) are readily identified in different tissues using standard meth- 
odologies and minor modifications thereof in tissues where expression may be low. For example, protein distribution 
may be determined using standard Western blot analysis or immunofluorescent techniques, and antibodies specific to 
is the morphogen or morphogens of interest. Similarly, the distribution of morphogen transcripts may be determined using 
standard Northern hybridization protocols and transcript-specific probes. 

[0058] Any probe capable of hybridizing specifically to a transcript, and distinguishing the transcript of interest from 
other, related transcripts may be used. Because the morphogens of this invention share such high sequence homology 
in their active, C-terminal domains, the tissue distribution of a specific morphogen transcript may best be determined 
20 using a probe specific for the pro region of the immature protein and/or the N-terminal region of the mature protein. 
Another useful sequence is the 3' non-coding region flanking and immediately following the stop codon. These portions 
of the sequence vary substantially among the morphogens of this invention, and accordingly, are specific for each 
protein. For example, a particularly useful Vgr-1 -specific probe sequence is the Pvull-Sacl fragment, a 265 bp fragment 
encoding both a portion of the untranslated pro region and the N-terminus of the mature sequence (see Lyons et al. 
25 (1989) PNAS 86:4554-4558 for a description of the cDNA sequence). Similarly, particularly useful mOP-1 -specific 

probe sequences are the BstXI -Bgll fragment, a 0.68 Kb sequence that covers approximately two-thirds of the mOP- 
1 pro region; a Stul-Stul fragment, a 0.2 Kb sequence immediately upstream of the 7-cysteine domain; and the 
Earl-Pstl fragment, an 0.3 Kb fragment containing a portion of the 3‘iintranslated sequence (See Seq. ID No. 18, 
where the pro region is defined essentially by residues 30-291.) Similar approaches may be used, for example, with 
30 hOPI (Seq. ID No. 16) or human or mouse OP2 (Seq. ID Nos. 20 and 22.) 

[0059] Using these morphogen-specific probes, which may be synthetically engineered or obtained from cloned 
sequences, morphogen transcripts can be identified in mammalian tissue, using standard methodologies well known 
to those having ordinary skill in the art. Briefly, total RNA is prepared from various adult murine tissues (e.g., liver, 
kidney, testis, heart, brain, thymus and stomach) by a standard methodology such as by the method of Chomczyaski 
35 et al. ((1 987) Anal. Biochem 1 62:1 56-1 59) and described below. Poly (A)+ RNA is prepared by using oligo (dT)-cellulose 
chromatography (e.g., Type 7, from Pharmacia LKB Biotechnology, Inc.). Poly (A)+ RNA (generally 15 pg) from each 
tissue is fractionated on a 1% agarose/formaldehyde gel and transferred onto a Nytran membrane (Schleicher & 
Schuell). Following the transfer, the membrane is baked at 80°C and the RNA is cross-linked under UV light (generally 
30 seconds at 1 mW/cm 2 ). Prior to hybridization, the appropriate probe (e.g., the Pvull-Sacl Vgr-1 fragment) is dena- 
40 tured by heating. The hybridization is carried out in a lucite cylinder rotating in a roller bottle apparatus at approximately 
1 rev/min for approximately 15 hours at 37°C using a hybridization mix of 40% formamide, 5 x Denhardts, 5 x SSPE, 
and 0.1% SDS. Following hybridization, the non-specific counts are washed off the filters in 0.1 x SSPE, 0.1% SDS at 
50°C. Northern blots performed using Vgr-1 probes specific to the variable N terminus of the mature sequence indicate 
that the Vgr-1 message is approximately 3.5 Kb. 

45 [0060] Figure 1 is a photomicrograph representing a Northern blot analysis probing a number of adult murine tissues 

with the Vgr-1 specific probes: liver, kidney, testis, heart, brain, thymus and stomach, represented in lanes 3-10, re- 
spectively. Lanes 1 and 12 are size standards and lanes 2 and 11 are blank. Among the tissues tested, Vgr-1 appears 
to be expressed most abundantly in adult lung, and to a lesser extent in adult kidney, heart and brain. These results 
confirm and expand on earlier studies identifying Vgr-1 and Vgr-1 -iike transcripts in several embryonic and adult murine 
50 tissue (Lyons et al. (1989) PNAS 86:4554-4558), as well as studies identifying OP-1 and CBMP2 in various human 
cDNA libraries (e.g., placenta, hippocampus, calvaria, and osteosarcoma, see Ozkaynak et al., (1990) EMBO 9: 
2085-2093). 

[0061] Using the same general probing methodology, mOP-1 transcripts also have been identified in a variety of 
murine tissues, including embryo and various developing tissues, as can be seen in Figures 2 and 3. Details of the 
55 probing methodology are disclosed in Ozkaynak, et al., (1991) Biochem. Biophys. Res. Commn. 179 :116-123, the 
disclosure of which is incorporated herein. The Northern blots represented in Figure 2 probed RNA prepared from 
developing brain, spleen, lung, kidney (and adrenal gland), heart, and liver in 13 day post natal mice (panel A) or 5 
week old mice (panel B). The OP-1 specific probe was a probe containing the 3' untranslated sequences described 
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supra (0.34 Kb Earl-Pst I fragment). As a control for RNA recovery, EF-Tu (translational elongation factor) mRNA 
expression also was measured (EF-Tu expression is assumed to be relatively uniform in most tissues). 

[0062] The arrowheads indicate the OP1 -specific messages observed in the various tissues. As can be seen in Fig. 
2, OP-1 expression levels vary significantly in the spleen, lung, kidney and adrenal tissues, while the EF-Tu mRNA 
5 levels are constant. Uniformly lower levels of EF-Tu mRNA levels were found in the heart, brain and liver. As can be 
seen from the photomicrograph, the highest levels of OP-1 mRNA appear to be in kidney and adrenal tissue, followed 
by the brain. By contrast, heart and liver did not give a detectable signal. Not shown are additional analyses performed 
on bladder tissue, which shows significant OP-1 mRNA expression, at levels close to those in kidney/adrenal tissue. 
The Northern blots also indicate that, like GDF-1 , OP-1 mRNA expression may be bicistonic in different tissues. Four 
io transcripts can be seen: 4 Kb, 2.4 Kb, 2.2 Kb, and 1 .8 Kb transcripts can be identified in the different tissues, and cross 
probing with OP-1 specific probes from the proregion and N-terminal sequences of the gene indicate that these tran- 
scripts are OP-1 specific. 

[0063] A side by side comparison of OP-1 and Vgr-1 in Figure 3 shows that the probes distinguish between the 
morphogens Vgr-1 and OP-1 transcripts in the different tissues, and also highlights the multiple transcription of OP-1 
15 in different tissues. Specifically, Fig. 3 compares the expression of OP-1 (Panels B and D), Vgr-1 (Panel C) and EF- 
Tu (Panel A) (control) mRNA in 17 day embryos (lane 1) and 3 day post-natal mice (lane 2). The same filter was used 
for sequential hybridizations with labeled DNA probes specific for OP-1 (Panels B and D), Vgr-1 (Panel C), and EF- 
Tu (Panel A). Panel A: the EF-Tu specific probe (control) was the 0.4 Kb Hindlll-Sacl fragment (part of the protein 
coding region), the Sacl site used belonged to the vector; Panel B: the OP-1 specific probe was the 0.68 Kb BstXI- 
20 Bgll fragment containing pro region sequences; Panel D; the OP-1 specific probe was the 0.34 Kb Earl-Pstl fragment 
containing the 3’ untranslated sequence; Panel C: the Vgr-1 specific probe was the 0.26 Kb Pvull-Sacl fragment used 
in the Vgr-1 blots described above. 

[0064] The 1 .8-2.5 Kb OP-1 mRNA appears approximately two times higher in three day post natal mice than in 1 7 
day embryos, perhaps reflecting phases in bone and/or kidney development. In addition, of the four messages found 
25 in brain, the 2.2 Kb transcript appears most abundant, whereas in lung and spleen the 1 .8 Kb message predominates. 
Finally, careful separation of the renal and adrenal tissue in five week old mice reveals that the 2.2 Kb transcripts were 
derived from renal tissue and the 4 Kb mRNA is more prominent in adrenal tissue (see Figure 2). 

[0065] Similarly, using the same general probing methodology, BMP3 and CBMP2B transcripts recently have been 
identified in abundance in lung tissue. 

30 [0066] Morphogen distribution in embryonic tissue can be determined using five or six-day old mouse embryos and 

standard immunofluorescence techniques in concert with morphogen-specific antisera. For example, rabbit anti-OP- 
1 antisera is readily obtained using any of a number of standard antibody protocols well known to those having ordinary 
skill in the art. The antibodies then are fluorescently labelled using standard procedures. A five or six-day old mouse 
embryo then is thin-sectioned and the various developing tissues probed with the labelled antibody, again following 
35 standard protocols. Using this technique, OP-1 protein has been detected in developing brain and heart. 

[0067] This method also may be used to identify morphogens in adult tissues undergoing repair. For example, a 
fracture site can be induced in a rat long bone such as the femur. The fracture then is allowed to heal for 2 or 3 days. 
The animal then is sacrificed and the fractured site sectioned and probed for the presence of the morphogen e.g., OP- 
1 , with fluorescently labelled rabbit anti-OP-1 antisera using standard immunolocalization methodology. This technique 
^o identifies OP-1 in muscle satellite cells, the progenitor cells for the development of muscle, cartilage and endochondral 
bone. In addition, OP-1 is detected with potential pluripotential stem cells in the bone marrow, indicating its morphogenic 
role in tissue repair and regeneration. 

[0068] OP-1 protein also has been identified in rat brain using standard immunofluorescence staining technique. 
Specifically, adult rat brain (2-3 months old) and spinal cord is frozen and sectioned. Anti-OP-1 , raised in rabbits and 
45 purified on an OP-1 affinity column prepared using standard methodologies, was added to the sections under standard 
conditions for specific binding. Goat anti-rabbit IgG, labelled with fluorescence, then was used to visualize OP-1 anti- 
body binding to tissue sections. 

[0069] As can be seen in FIG 4A and 4B, immunofluorescence staining demonstrates the presence of OP-1 in adult 
rat central nervous system (CNS.) Similar and extensive staining is seen in both the brain (4A) and spinal cord (4B). 
so OP-1 appears to be predominantly localized to the extracellular matrix of the grey matter, present in all areas except 
the neuronal cell bodies. In white matter, staining appears to be confined to astrocytes. A similar staining pattern also 
was seen in newborn rat (10 day old) brain sections. 

CELL DIFFERENTIATION 

55 

[0070] The ability of morphogens of this invention to induce cell differentiation can be determined by culturing early 
mesenchymal cells in the presence of the morphogen and then studying the histology of the cultured cells by staining 
with toluidine blue. For example, it is known that rat mesenchymal cells destined to become mandibular bone, when 



23 




EP 0 575 555 B1 



separated from the overlying epithelial cells at stage 11 and cultured in vitro under standard tissue culture conditions, 
will not continue to differentiate. However, if these same cells are left in contact with the overlying endoderm for an 
additional day, at which time they become stage 1 2 cells, they will continue to differentiate on their own in vitro to form 
chondrocytes. Further differentiation into obsteoblasts and, ultimately, mandibular bone, requires an appropriate local 
5 environment, e.g., a vascularized environment. 

[0071 ] It has now been discovered that stage 1 1 mesenchymal cells, cultured in vitro in the presence of a morphogen, 
e.g., OP-1 , continue to differentiate in vitro to form chondrocytes. These stage 1 1 cells also continue to differentiate in 
vitro if they are cultured with the cell products harvested from the overlying endodermal cells. Moreover, OP-1 can be 
identified in the medium conditioned by endodermal cells either by Western blot or immunofluorescence. This exper- 
io iment may be performed with other morphogens and with different mesenchymal cells to assess the cell differentiation 
capability of different morphogens, as well as their distribution in different developing tissues. 

[0072] As another example of morphogen-induced cell differentiation, the effect of OP-1 on the differentiation of 
neuronal cells has been tested in culture. Specifically, the effect of OP-1 on the NG108-15 neuroblastoma x glioma 
hybrid clonal cell line has been assessed. The cell line shows a fibroblastic-type morphology in culture! The cell line 
15 can be induced to differentiate chemically using 0.5 mM butyrate, 1% DMSO or 500 mM Forskolin, inducing the ex- 
pression of virtually all important neuronal properties of cultured primary neurons. However, chemical induction of these 
cells also induces cessation of cell division. 

[0073] In the present experiment NG108-15 cells were subcultured on poly-L-lysine coated 6 well plates. Each well 
contained 40-50,000 cells in 2.5 ml of chemically defined medium. On the third day 2.5 \i\ of OP-1 in 60% ethanol 
20 containing 0.025% trifluoroacetic was added to each well. OP-1 concentrations of 0, 1, 10, 40 and 100 ng/ml were 
tested. The media was changed daily with new aliquots of OP-1 . After four days with 40 and 100 ng OP-1/ml concen- 
trations, OP-1 induced differentiation of the NG108-15 cells. Figure 5 shows the morphological changes that occur. 
The OP-1 induces clumping and rounding of the cells and the production of neurite outgrowths (processes). Compare 
FIG 5A (naive NG108-1 5 cells) with FIG 5B, showing the effects of OPI-treated cells. Thus the OP-1 can induce the 
25 cells to differentiate into a neuronal cell morphology. Some of the outgrowths appear to join in a synaptic-type junction. 
This effect was not seen in cells incubated with TGF-B1 at concentrations of 1 to 100 ng/ml. 

[0074] The neuroprotective effects of OP-1 were demonstrated by comparison with chemical differentiation agents 
on the NG108-15 cells. 50,000 cells were plated on 6 well plates and treated with butyrate, DMSO, Forskolin or OP- 
1 for four days. Cell counts demonstrated that in the cultures containing the chemical agents the differentiation was 
30 accompanied by a cessation of cell division. In contrast, the cells induced to differentiate by OP-1 continued to divide, 
as determined by H^-thymidine uptake. The data suggest that OP-1 is capable of maintaining the stability of the cells 
in culture after differentiation. 

[0075] As yet another, related example, the ability of the morphogens of this invention to induce the "^differentiation" 
of transformed cells also has been assessed. Specifically, the effect of OP-1 on human EC cells (embryo carcinoma 
35 cells, NTERA-Z CL.D1 ) is disclosed herein. In the absence of an external stimulant these cells can be maintained as 
undifferentiated stem cells, and can be induced to grow in serum free media (SFM). In the absence of morphogen 
treatment the cells proliferate rampantly and are anchorage-independent. The effect of morphogen treatment is seen 
in Figs. 6A-D. Figs 6A and 6B show 4 days of growth in SFM in the presence of OP-1 (25ng/ml, 6A) or the absence 
of morphogen (6B). Figs. 6C and 6D are 5 days growth in the presence of lOng/ml OP-1 (6C) or no morphogen (6D). 
40 Figs. 6C and 6D are at 1 0x and 20x magnification compared to FIGs 6A and 5B. As can readily be seen, in the presence 

of OP-1 , EC cells grow as flattened cells, becoming anchorage dependent. In addition, growth rate is reduced approxr 
imately 10 fold. Finally, the cells are induced to differentiate. 

MAINTENANCE OF PHENOTYPE 

45 

[0076] The morphogens of this invention also may be used to maintain a cell's differentiated phenotype. This mor- 
phogenic capability is particularly useful for inducing the continued expression of phenotype in senescent or quiescent 
cells. 

[0077] The phenotypic maintenance capability of morphogens is readily assessed. A number of differentiated cells 
so become senescent or quiescent after multiple passages under standard tissue culture conditions in vitro. However, if 
these cells are cultivated in vitro in association with a morphogen of this invention, the cells are induced to maintain 
expression of their phenotype through multiple passages. For example, the alkaline phosphatase activity of cultured 
osteoblasts, like cultured osteoscarcoma cells and calvaria cells, is significantly reduced after multiple passages in 
vitro. However, if the cells are cultivated in the presence of a morphogen (e.g., OP-1 ), alkaline phosphatase activity is 
55 maintained over extended periods of time. Similarly, phenotypic expression of myocytes also is maintained in the 
presence of the morphogen. This experiment may be performed with other morphogens and different cells to assess 
the phenotypic maintenance capability of different morphogens on cells of differing origins. 

[0078] Phenotypic maintenance capability also may be assessed in vivo, using a rat model for osteoporosis* disclosed 
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in co-pending. USSN 752,857, filed August 30, 1991,, (corresponding to granted US Patent No. 5,674,844). As dis- 
closed therein, Long Evans rats are ovariectomized to produce an osteoporotic condition resulting from decreased 
estrogen production. Eight days after ovariectomy, rats are systemically provided with phosphate buffered saline (PBS) 
or OP-1 (21 pg or 20 pg) for 22 days. The rats then are sacrificed and serum alkaline phosphatase levels, serum 
5 calcium levels, and serum osteocalcin levels are determined, using standard methodologies. Three-fold higher levels 
of osteocalcin levels are found in rats provided with 1 or 20 pg of OP-1. Increased alkaline phosphatase levels also 
were seen. Histomorphometric analysis on the tibial diaphysical bone shows OP-1 can reduce bone mass lost due to 
the drop in estrogen levels. 

10 CELL STIMULATION 

[0079] The ability of the morphogens of this invention to stimulate the proliferation of progenitor cells also can be 
assayed readily in vitro. Useful naive stem cells include pluripotential stem cells, which may be isolated from bone 
marrow or umbilical cord blood using conventional methodologies, (see, for example, Faradji et al., (1988) Vox Sang. 
15 55 (3): 1 33-138 or Broxmeyer et al., (1989) PNAS 86 (10) :3828-3832), as well as naive stem cells obtained from blood. 

Alternatively, embryonic cells (e.g., from a cultured mesodermal cell line) may be useful. . 

[0080] Another method for obtaining progenitor cells and for determining the ability of morphogens to stimulate cell 
proliferation is to capture progenitor cells from an in vivo source. For example, a biocompatible matrix material able to 
allow the influx of migratory progenitor cells may be implanted at an in vivo site long enough to allow the influx of 
20 migratory progenitor cells. For example, a bone-derived, guanidine-extracted matrix, formulated as disclosed for ex- 

ample in Sampath et al. ((1983) PNAS 80 :6591-6595), or U.S. Patent No. 4,975,526, may be implanted into a rat at a 
subcutaneous site, essentially following the method of Sampath et al. (ibid). After three days the implant is removed, 
and the progenitor cells associated with the matrix dispersed and cultured. 

[0081] Progenitor cells, however obtained, then are incubated in vitro with a suspected morphogen under standard 
25 cell culture conditions well known to those having ordinary skill in the art. In the absence of external stimuli, the pro- 

genitor cells do not, or minimally proliferate on their own in culture. However, if the cells are cultured in the presence 
of a morphogen, such as OP-1 , they are stimulated to proliferate. Cell growth can be determined visually or spectro- 
photometrically using standard methods well known in the art. 

30 PROLIFERATION OF PROGENITOR CELL POPULATIONS 

[0082] Progenitor cells may be stimulated to proliferate in vivo or ex vivo. The cells may be stimulated in vivo by 
injecting or otherwise providing a sterile preparation containing the morphogen into the individual. For example, the 
hemopoietic pluripotential stem cell population of an individual may be stimulated to proliferate by injecting or otherwise 
35 providing an appropriate concentration of the morphogen to the individual's bone marrow. 

[0083] Progenitor cells may be stimulated ex vivo by contacting progenitor cells of the population to be enhanced 
with a morphogen under sterile conditions at a concentration and for a time sufficient to stimulate proliferation of the 
cells. In general, a period of from about 10 minutes to about 24 hours should be sufficient. The stimulated cells then 
are provided to the individual as, for example, by injecting the cells to an appropriate in vivo locus. Suitable biocom- 
*0 patible progenitor cells may be obtained by any of the methods known in the art or described herein. 

REGENERATION OF DAMAGED OR DISEASED TISSUE 

[0084] The morphogens of this invention may be used to repair diseased or damaged mammalian tissue. The tissue 
45 to be repaired is preferably assessed, and excess necrotic or interfering scar tissue removed as needed, by surgical, 
chemical, ablating or other methods known in the medical arts. 

[0085] The morphogen then may be provided directly to the tissue locus as part of a sterile, biocompatible compo- 
sition, either by surgical implantation or injection. Alternatively, a sterile, biocompatible composition containing morph- 
ogen-stimulated progenitor cells may be provided to the tissue locus. The existing tissue at the locus, whether diseased 
50 or damaged, provides the appropriate matrix to allow the proliferation and tissue-specific differentiation of progenitor 
cells. In addition, a damaged or diseased tissue locus, particularly one that has been further assaulted by surgical 
means, provides a morphogenically permissive environment. For some tissues, it is envisioned that systemic provision 
of the morphogen will be sufficient. 

[0086] In some circumstances, particularly where tissue damage is extensive, the tissue may not be capable of 
55 providing a sufficient matrix for cell influx and proliferation. In these instances, it may be necessary to provide the 
morphogen or morphogen-stimulated progenitor cells to the tissue locus in association with a suitable, biocompatible 
formulated matrix, prepared by any of the means described below. The matrix preferably is tissue-specific, jn vivo 
biodegradable, and comprises particles having dimensions within the range of 70*850p.m, most preferably 150-420jxm. 
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[0087] The morphogens of this invention also may be used to prevent or substantially inhibit scar tissue formation 
following an injury. If a morphogen is provided to a newly injured tissue locus, it can induce tissue morphogenesis at 
the locus, preventing the aggregation of migrating fibroblasts into non-differentiated connective tissue. The morphogen 
preferably is provided as a sterile pharmaceutical preparation injected into the tissue locus within five hours of the 
injury. Several non-limiting examples follow, illustrating the morphogens regenerate capabilities in different issues. The 
proteins of this invention previously have been shown to be capable of inducing cartilage and endochondral bone 
formation (See, for example U.S. Patent No. 5,011,691). 

[0088] . As an example, protein-induced morphogenesis of substantially injured liver tissue following a partial hepa- 
tectomy is disclosed. Variations on this general protocol may be used to test morphogen activity in other different 
tissues. The genera! method involves excising an essentially nonregenerating portion of a tissue and providing the 
morphogen, preferably as a soluble pharmaceutical preparation to the excised tissue locus, closing the wound and 
examining the site at a future date. Like bone, liver has a potential to regenerate upon injury during post-fetal life. 
[0089]. Morphogen, (e.g., purified recombinant human OP-1 , mature form), was solubilized (1 mg/ml) in 50% ethanol 
(or compatible solvent) containing 0.1% trifluoroacetic acid (or compatible acid). The injectable OP-1 solution was 
prepared by diluting one volume of OP- 1 /solvent-acid stock solution with 9 volumes of 0.2% rat.serum albumin in sterile 
PBS (phosphate-buffered saline). 

[0090] Growing rats or aged rats were anesthetized by using ketamine. Two of the liver lobes (left and right) were 
cut out (approximately 1/3 of the lobe) and the OP-1 was injected locally at multiple sites along the cut ends. The 
amount of OP-1 injected was 1 00 pg in 1 00 of PBS/RSA (phosphate buffered saline/rat serum albumin) injection buffer. 
Placebo samples are injection buffer without OP-1 . Five rats in each group were used. The wound was closed and the 
rats were allowed to eat normal food and drink tap water. 

[0091] After 12 days, the rats were sacrificed and liver regeneration was observed visually. The photomicrograph in 
Fig. 7 illustrates dramatically the regenerative effects of OP-1 on liver regeneration .'The OP-1-injected group showed 
complete liver tissue regeneration and no sign remained of any cut in the liver (animal 2). By contrast, in the control 
group into which only PBS was injected only minimal regeneration was evidenced (animal 1). In addition, the incision 
remains in this sample. 

[0092] As another example, the ability of the morphogens of this invention to induce dentinogenesis also was as- 
sessed. To date, the unpredictable response of dental pulp tissue to injury is a basic clinical problem in dentistry. 
Cynomolgus monkeys were chosen as primate models as monkeys are presumed to be more indicative of human 
dental biology than models based on lower non-primate mammals. 

[0093] Using standard dental surgical procedures, small areas (e.g., 2mm) of dental pulps were surgically exposed 
by removing the enamel and dentin immediately, above the pulp (by drilling) of sample teeth, performing a partial 
amputation of the corona! pulp tissue, inducing hemostasis, application of the pulp treatment, and sealing and filling 
the cavity by standard procedures. 

[0094] Pulp treatments used were: OP-1 dispersed in a carrier matrix; carrier matrix alone and no treatment. Twelve 
teeth per animal (four for each treatment) were prepared, and two animals were used. At four weeks, teeth were 
extracted and processed histologically for analysis of dentin formation, and/or ground to analyze dentin mineralization. 
FIG. 8 illustrates dramatically the effect of morphogen on osteodentin reparation. FIG. 8A is a photomicrograph of the 
control treatment (PBS) and shows little or no reparation. FIG. 8B is a photomicrograph of treatment with carrier alone, 
showing minimal reparation. By contrast, treatment with morphogen (FIG. 8C) shows significant reparation. The results 
of FIG. 8 indicate that OP-1-CM (OP-1 plus carrier matrix) reliably induced formation of reparative or osteodentin 
bridges on surgically exposed healthy dental pulps. By contrast, pulps treated with carrier matrix alone, or not treated 
failed to form reparative dentin. 

[0095] As another example, the morphogen-induced regenerative effects on central nervous system (CNS) repair 
may be assessed using a rat brain stab model. Briefly, male Long Evans rats are anesthesized and the head area 
prepared for surgery. The calvariae is exposed using standard surgical procedures and a hole drilled toward the center 
of each lobe using a 0.035K wire, just piercing the calvariae. 25pl solutions containing ejther morphogen (OP-1 , 25pg) 
or PBS then is provided to each of the holes by Hamilton syringe. Solutions are delivered to a depth approximately 3 
mm below the surface, into the underlying cortex, corpus callosum and hippocampus. The skin then is sutured and 
the animal allowed to recover. 

[0096] Three days post surgery, rats are sacrificed by decapitation and their brains processed for sectioning. Scar 
tissue formation is evaluated by immunofluoresence staining for glial fibrillary acidic protein, a marker protein for glial 
scarring, to qualitatively determine the degree of scar formation. Sections also are probed with anti-OP-1 antibodies 
to determine the presence of OP-1 . Reduced levels of glial fibrillary acidic protein are anticipated in the tissue sections 
of animals treated with morphogen, evidencing the ability of morphogen to inhibit glial scar formation, thereby stimu- 
lating nerve regeneration. 
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MORPHOGEN ACTIVITY MODULATION 

[0097] Antibodies to morphogens of this invention have been identified in healthy human sera. In addition, implanting 
devices comprising morphogens (e.g., OP-1 ) have been discovered to induce an increase in anti-morphogen antibodies 
5 (e.g., anti-OP-1 antibodies). It is anticipated that these antibodies comprise part of the body's regulation of morphogen 

activity in vivo. The presence of the antibodies, and fluctuations in their levels, which are readily monitored, can provide 
a useful method for monitoring tissue stasis and tissue viability (e.g., identification of a pathological state). For example, 
standard radioimmunoassays or ELISA may be used to detect and quantify endogeous anti-morphogen antibodies in 
sera. Antibodies or other binding proteins capable of detecting anti-morphogen antibodies may be obtained using 
io standard methodologies. 

MATRIX PREPARATION 

[0098] The morphogens of this invention may be implanted surgically, dispersed in a biocompatible, preferably in 
15 vivo biodegradable matrix appropriately modified to provide a structure in which the morphogen may be dispersed and 
which allows the influx, differentiation and proliferation of migrating progenitor cells. The matrix also should provide 
signals capable of directing the tissue specificity of the differentiating cells, as well as a morphogenicaily permissive 
environment, being essentially free of growth inhibiting signals. 

[0099] In the absence of these features the matrix does not appear to be suitable as part of a morphogenic compo- 
se sition. Recent studies on osteogenic devices (morphogens dispersed within a formulated matrix) using matrices for- 
mulated from polylactic acid and/or polyglycolic acid biopolymers, ceramics (a-tri-calcium-phosphate), or hydroxyap- 
atite show that these materials, by themselves, are unable to provide the appropriate environment for inducing de novo 
endochondral bone formation in rats by themselves. In addition, matrices formulated from commercially availabiehighly 
purified, reconstituted collagens or naturally-derived non-bone, species-specific collagen (e.g., from rat tail tendon) 
25 also are unsuccessful in inducing bone when implanted in association with an osteogenic protein. These matrices 
apparently lack specific structurally-related features which aid in directing the tissue specificity of the morphogen- 
stimulated, differentiating progenitor cells. 

[01 00] The formulated matrix may be shaped as desired in anticipation of surgery or may be shaped by the physician 
or technician during surgery. Thus, the material may be used in topical, subcutaneous, intraperitoneal, or intramuscular 
30 implants to repair tissue or to induce its growth de novo. The matrix preferably is biodegradable in vivo, being slowly 
absorbed by the body and replaced by new tissue growth, in the shape or very nearly in the shape of the implant. 
[0101] Details of how to make and how to use the matrices useful in this invention are disclosed below. 

TISSUE-DERIVED MATRICES 

35 

[ 0102 ] Suitable biocompatible, in vivo biodegradable acellular matrices may be prepared from naturally-occurring 
tissue. The tissue is treated with suitable agents to substantially extract the cellular, nonstructural components of the 
tissue. The agents also should be capable of extracting any growth inhibiting components associated with the tissue. 
The resulting material is a porous, acellular matrix, substantially depleted in nonstructurally-associated components. 
40 [0103] The matrix also may be further treated with agents that modify the matrix, increasing the number of pores 

and micropits on its surfaces. Those skilled in the art will know how to determine which agents are best suited to the 
extraction of nonstructural components for different tissues. For example, soft tissues such as liver and lung may be 
thin-sectioned and exposed to a nonpolar solvent such as, for example, 1 00% ethanol, to destroy the cellular structure 
of the tissue and extract nonstructural components. The material then is dried and pulverized to yield nonadherent 
45 porous particles. Structural tissues such as cartilage and dentin where collagen is the primary component may be 
demineralized and extracted with guanidine, essentially following the method of Sampath et al. (1983) PNAS 80: 
6591-6595. For example, pulverized and demineralized dentin is extracted with five volumes of 4M guanidine-HCI, 
50mM Tris-HCI, pH 7.0 for 16 hours at 4°C. The suspension then is filtered. The insoluble material that remains is 
collected and used to fabricate the matrix. The material is mostly collagenous in manner. It is devoid of morphogenic 
so activity. The matrix particles may further be treated with a collagen fibril-modifying agent that extracts potentially un- 
wanted components from the matrix, and alters the surface structure of the matrix material. Useful agents include 
acids, organic solvents or heated aqueous media. A detailed description of these matrix treatments are disclosed in 
U.S. Patent No. 4,975,526 and PCT publication US90/00912, published September 7, 1990 (W090/10018). 

[0104] The currently most preferred agent is a heated aqueous fibril-modifying medium such as water, to increase 
55 the matrix particle surface area and porosity. The currently most preferred aqueous medium is an acidic aqueous 
medium having a pH of less than about 4.5, e.g., within the range of about pH 2 - pH 4 which may help to “swell 0 the 
collagen before heating. 0.1% acetic acid, which has a pH of about 3, currently is most preferred. 0.1 M acetic acid 
also may be used. 
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[0105] Various amounts of delipidated, demineralized guanidine-extracted bone collagen are heated in the aqueous 
medium (1 g matrix/30ml aqueous medium) under constant stirring in a water jacketed glass flask, and maintained at 
a given temperature for a predetermined period of time. Preferred treatment times are about one hour, although ex- 
posure times of between about 0.5 to two hours appear acceptable. The temperature employed is held constant at a 
5 temperature within the range of about 37°C to 65°C. The currently preferred heat treatment temperature is within the 
range of about 45°C to 60°C. 

[0106] After the heat treatment, the matrix is filtered, washed, lyophilized and used for implant. Where an acidic 
aqueous, medium is used, the matrix also is preferably neutralized prior to washing and lyophilization. A currently 
preferred neutralization buffer is a 200mM sodium phosphate buffer, pH 7.0. To neutralize the matrix, the matrix pref- 
10 erably first is allowed to cool following thermal treatment, the acidic aqueous medium (e.g., 0.1% acetic acid) then is 
removed and replaced with the neutralization buffer and the matrix agitated for about 30 minutes. The neutralization 
buffer then may be removed and the matrix washed and lyophilized. 

[0107]. Other useful fibril-modifying treatments include acid treatments (e.g., trifluoroacetic acid and hydrogen fluo- 
ride) and solvent treatments such as dichloromethane, acetonitrile, isopropanol and chloroform, as weir as particular 
is acid/solvent combinations. 

[0108] After contact with the fibril-modifying agent, the treated matrix may be washed to remove any extracted com- 
ponents, following a form of the procedure set forth below: 

1 . Suspend matrix preparation in TBS (Tris-buffered saline) lg/200 ml and stir at 4°C for 2 hrs; or in 6 M urea, 50 

20 mM Tris-HCI, 500 mM NaCI, pH 7.0 (UTBS) or water and stir at room temperature (RT) for 30 minutes (sufficient 

time to neutralize the pH); 

2. Centrifuge and repeat wash step; and 

25 3. Centrifuge; discard supernatant; water wash residue; and then lyophilize. 

SYNTHETIC TISSUE-SPECIFIC MATRICES 

[0109] In addition to the naturally-derived tissue-specific matrices described above, useful tissue-specific matrices 
30 may be formulated synthetically if appropriately modified. These porous biocompatible, in vivo biodegradable synthetic 
matrices are disclosed in PCT publication US91/03603, published December 12, 1991 (W091/18558). Briefly, the 
matrix comprises a porous crosslinked structural polymer of biocompatible, biodegradable collagen and appropriate, 
tissue-specific glycosaminoglycans as tissue-specific cell attachment factors. Collagen derived from a number of sourc- 
es may be suitable for use in these synthetic matrices, including insoluble collagen, acid-soluble collagen, collagen 
35 soluble in neutral or basic aqueous solutions, as well as those collagens which are commercially available. 

[ 0110 ] Glycosaminoglycans (GAGs) or mucopolysaccharides are hexosamine-containing polysaccharides of animal 
origin that have a tissue specific distribution, and therefore may be used to help determine the tissue specificity of the 
morphogen-stimulated differentiating cells. Reaction with the GAGs also provides collagen with another valuable prop- 
erty, i.e., inability to provoke an immune reaction (foreign body reaction) from an animal host. 

40 [0111] Chemically, GAGs are made up of residues of hexoseamines glycosidically bound and alternating in a more- 

or-less regular manner with either hexouronic acid or hexose moieties (see, e.g., Dodgson et al. in Carbohydrate 
Metabolism and its Disorders (Dickens et al., eds.) Vol. 1, Academic Press (1968)). Useful GAGs include hyaluronic 
acid, heparin, heparin sulfate, chondroitin 6-sulfate, chondroitin 4-sulfate, dermatan sulfate, and keratin sulfate. Other 
GAGs are suitable for forming the matrix described herein, and those skilled in the art will either know or be able to 
45 ascertain other suitable GAGs using no more than routine experimentation. For a more detailed description of muco- 
polysaccharides, see Aspinall, Polysaccharides , Pergamon Press, Oxford (1970). For example, as disclosed in U.S. 
Application Serial No. 529,852 (corresponding to international publication number WO-A-91 18558, chondroitin-6-sul- 
fate can be used where endochondral bone formation is desired. Heparin sulfate, on the other hand, may be used to 
formulate synthetic matrices for use in lung tissue repair. 

so [0112] Collagen can be reacted with a GAG in aqueous acidic solutions, preferably in diluted acetic acid solutions. 
By adding the GAG dropwise into the aqueous collagen dispersion, coprecipitates of tangled collagen fibrils coated 
with GAG results. This tangled mass of fibers then can be homogenized to form a homogeneous dispersion of fine 
fibers and then filtered and dried. 

[0113] Insolubility of the collagen-GAG products can be raised to the desired degree by covalently cross-linking 
55 these materials, which also serves to raise the resistance to resorption of these materials. In general, any covalent 
cross-linking method suitable for cross-linking collagen also is suitable for cross-linking these composite materials, 
although crosslinking by a dehydrothermal process is preferred. 

[0114] When dry, the crosslinked particles are essentially spherical, with diameters of about 500 pm. Scanning elec- 
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tron miscroscopy shows pores of about 20 pm on the surface and 40 pm on the interior. The interior is made up of 
both fibrous and sheet-like structures, providing surfaces for cell attachment. The voids interconnect, providing access 
to the cells throughout the interior of the particle. The material appears to be roughly 99.5% void volume, making the 
material very efficient in terms of the potential cell mass that can be grown per gram of microcarrier. 

5 [01 1 5] The morphogens described herein can be combined and dispersed in an appropriately modified tissue-specific 

matrix using any of the methods described below: 

1 . Ethanol Precipitation 

Matrix is added to the morphogen dissolved in guanidine-HCl. Samples are vortexed and incubated at a low 
10 temperature. Samples are then further vortexed. Cold absolute ethanol is added to the mixture which is then stirred 

and incubated. After centrifugation (microfuge, high speed) the supernatant is discarded. The matrix is washed 
with cold concentrated ethanol in water and then lyophilized. 

2. Acetonitrile Trifluoroacetic Acid Lyophilization 

15 In this procedure, morphogen in an acetonitrile trifluroacetic acid (ACN/TFA solution is added to the carrier 

material. Samples are vigorously vortexed many times and then lyophilized. 

3. Buffered Saline Lyophilization 

Morphogen preparations in physiological saline may also be vortexed with the matrix and lyophilized to produce 
20 morphogenically active material. 

BIOASSAY 

[01 1 6] The following sets forth various procedures for evaluating the in vivo morphogenic utility of the morphogens 
25 and morphogenic compositions of this invention. The proteins and compositions may be injected or surgically implanted 
in a mammal, following any of a number of procedures well known in the art. For example, surgical implant bioassays 
may be performed essentially following the procedure of Sampath et al. (1983) PNAS 80:6591-6595. 

Histological Evaluation 

30 

[01 1 7] Histological sectioning and staining is preferred to determine the extent of morphogenesis in vivo, particularly 
in tissue repair procedures. Excised implants are fixed in Bouins Solution, embedded in paraffin, and cut into 6-8 ^im 
sections. Staining with toluidine blue or hemotoxylin/eosin demonstrates clearly the ultimate development of the new 
tissue. Twelve day implants are usually sufficient to determine whether the implants contain newly induced tissue. 

35 [0118] Successful implants exhibit a controlled progression through the stages of induced tissue development al- 

lowing one to identify and follow the tissue-specific events that occur. For example, in endochondral bone formation 
the stages include: 

(1 ) leukocytes on day one; (2) mesenchymal cell migration and proliferation on days two and three; 

40 (3) chondrocyte appearance on days five and six; 

(4) cartilage matrix formation on day seven; 

(5) cartilage calcification on day eight; (6) vascular invasion, appearance of osteoblasts, and formation of new 
bone on days nine and ten; (7) appearance of osteoblastic and bone remodeling and dissolution of the implanted 
matrix on days twelve to eighteen; and 

45 (8) hematopoietic bone marrow differentiation in the ossicle on day twenty-one. 

Biological Markers 

[0119] In addition to histological evaluation, biological markers may be used as a marker for tissue morphogenesis. 
so Useful markers include tissue-specific enzymes whose activities may be assayed (e.g., spectrophotometrically) after 
homogenization of the implant. These assays may be useful for quantitation and for obtaining an estimate of tissue 
formation quickly after the implants are removed from the animal. For example, alkaline phosphatase activity may be 
used as a marker for osteogenesis. 

[0120] Incorporation of systemically provided morphogens may be followed using tagged morphogens (e.g., radio- 
55 actively labelled) and determining their localization in new tissue, and/or by monitoring their disappearance from the 
circulatory system using a standard pulse-chase labeling protocol. The morphogen also may be provided with a tissue- 
specific molecular tag, whose uptake may be monitored and correlated with the concentration of morphogen provided. 
As an example, ovary removal in female rats results in reduced bone alkaline phosphatase activity, rendering the rats 
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predisposed to osteoporosis. If the female rats now are provided with a morphogen, e.g., OP-1, a reduction in the 
systemic concentration of calcium (CA 2+ ) is seen, which correlates with the presence of the provided morphogen and 
can be shown to correspond to increased alkaline phosphatase activity. 

[0121] The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the 
5 scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes 
which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. 

SEQUENCE LISTING 

io [ 0122 ] 

(1) GENERAL INFORMATION: 

(i) APPLICANT: COHEN, CHARLES M. 

15 

KUBERASAMPATH, THANGAVEL 
PANG, ROY H.L. 

OPPERMANN, HERMANN 
RUEGER, DAVID C. 

20 

(ii) TITLE OF INVENTION: PROTEIN-INDUCED MORPHOGENESIS 
(Hi) NUMBER OF SEQUENCES: 23 

25 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: TESTA, HURWITZ & THIBEAULT 

(B) STREET: 53 STATE STREET 

(C) CITY: BOSTON 

30 (D) STATE: MASSACHUSETTS 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

35 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1.25 

40 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 667,274 

(B) FILING DATE: 11 -MAR-1 991 

45 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 752,764 

(B) FILING DATE: 30-AUG-1 991 

50 

(2) INFORMATION FOR SEQ ID NO:1 : . 

(i) SEQUENCE CHARACTERISTICS: 

55 (A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 1 

(D) OTHER INFORMATION: Each Xaa indicates one of the 20 naturally-occurring L-isomer, d-amino acids 
or a derivative thereof. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
20 25 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
30 35 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
85 90 

Xaa Cys Xaa 
95 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 2 

(D) OTHER INFORMATION: Each Xaa indicates one of the 20 naturally-occurring L-isomer, a-amino acids 
or a derivative thereof. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
20 25 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 
30 35 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
85 90 - 

Xaa Cys Xaa 
95 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 3 

(D) OTHER INFORMATION: wherein each Xaa is independently selected from a group of 
specified amino acids as defined in the specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



one or more 



32 




EP 0 575 555 B1 



5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Leu Tyr Val Xaa Phe 
1 5 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Ala Pro Gly Xaa Xaa Xaa Ala 
15 20 

Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
25 30 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
40 45 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
55 60 

Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 
70 75 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
85 90 

Xaa Cys Gly Cys Xaa 
95 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 4 

(D) OTHER INFORMATION: wherein each Xaa is independently selected from a group of one or more 
specified amino acids as defined in the specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe 
15 10 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

15 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 
20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 

Asn Xaa Xaa Asn His Ala Xaa Xaa 
45 50 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 
60 65 

Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 



Xaa Xaa Xaa Leu Xaa Xaa Xaa 
75 80 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
90 95 

Xaa Cys Gly Cys Xaa 
100 



(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 39 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: hOP-1 (mature torm) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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Thr 


Gly 


5 


l 

Asn 


Arg 


Ser 




10 








Glu 


Ala 


Leu 


10 




20 






Glu 


Asn 


Ser 








30 


15 


Ala 


Cys 


Lys 




Ser 


Phe 


Arg 


20 


Trp 


He 


lie 




55 








Ala 


Tyr 


Tyr 


25 




65 





30 


Phe 


Pro 


Leu 








75 




Thr 


Asn 


His 


35 


Val 


His 


Phe 




Pro 


Lys 


Pro 


40 


100 








Leu 


Asn 


Ala 






110 




45 


Asp 


Asp 


Ser 








120 




Lys 


Tyr 


Arg 


50 


Cys 


Gly 


Cys 



55 (2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 



Ser 


Lys 


Gin 


Arg 


Ser 


Gin 


Lys 


5 

Thr 


Pro 


Lys 


Asn 


Gin 


Arg 


Met 


15 

Ala 


Asn 


Val 


Ala 


Ser 


Ser 


Asp 


25 

Gin * 


Arg 


Gin 


Lys 


His 


Glu 


Leu 


35 

Tyr 


Val 


40 

Asp 


Leu 


Gly 


Trp 


Gin 


45 

Asp 


Ala 


50 

Pro 


Glu 


Gly 


Tyr 


Ala 


Cys 


Glu 


60 

Gly 


Glu 


Cys 


Ala 



70 



Asn 


Ser 


Tyr 


Met 


Asn 

80 


Ala 


Ala 

85 


lie 


Val 


Gin 


Thr 


Leu 

90 


lie 


Asn 

95 


Pro 


Glu 


Thr 


Val 


Cys 


Cys 


Ala 

105 


Pro 


Thr 


Gin 


lie 


Ser 


Val 


Leu 

115 


Tyr 


Phe 


Ser 


Asn 


Val 


lie 


Leu 

125 


Lys 


Asn 

130 

His 


Met 


Val 


Val 


Arg 


Ala 

135 



35 




EP 0 575 555 B1 



(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME: mOP-1 (mature form) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



10 

Ser Thr Gly 
1 

15 Asn Arg Ser 

10 

Glu Ala Leu 



25 



30 



35 



40 



45 



50 



Gly 


Lys 

5 


Gin 


Arg 


Ser 


Gin 


Lys 


Thr 


Pro 

15 


Lys 


Asn 


Gin 


Arg 


Met 


Ala 


Ser 

25 


Val 


Ala 
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Glu 


Asn 


Ser 








30 


5 


Ala 


Cys 


Lys 




Ser 


Phe 


Arg 


10 


Trp 


lie 


lie 




55 








Ala 


Tyr 


Tyr 


15 




65 






Phe 


Pro 


Leu 








75 


20 


Thr 


Asn 


His 




Val 


His 


Phe 


25 


Pro 


Lys 


Pro 




100 








Leu 


Asn 


Ala 


30 




110 






Asp 


Asp 


Ser 








120 


35 


Lys 


Tyr 


Arg 




Cys 


Gly 


Cys 



Ser 


Ser 


Asp 


Gin 


Arg 

35 


Gin 


Lys 

40 


His 


Glu 


Leu 


Tyr 


Val 

45 


Asp 


Leu 

50 


Gly 


Trp 


Gin 


Asp 


Ala 


Pro 


Glu 

60 


Gly 


Tyr 


Ala 


Cys 


Glu 


Gly 


Glu 

70 


Cys 


Ala 


Asn 


Ser 


Tyr 


Met 


Asn 

80 


Ala 


Ala 

85 


lie 


Val 


Gin 


Thr 


Leu 

90 


lie 


Asn 

95 


Pro 


Asp 


Thr 


Val 


Cys 


Cys 


Ala 

105 


Pro 


Thr 


Gin 


lie 


Ser 


Val 


Leu 

115 


Tyr 


Phe 


Ser 


Asn 


Val 


lie 


Leu 

125 


Lys 


Asn 

130 

His 


Met 


Val 


Val 


Arg 


Ala 

135 



40 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

45 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

so (ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME: hOP-2 (mature form) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

55 
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Ala 


Val 


Arg 


Pro 


Leu 


Arg 


Arg 


Arg 


Gin 




1 








5 










5 


Pro 


Lys 


Lys 


Ser 


Asn 


Glu 


Leu , 


Pro 


Gin 




10 










15 










Ala 


Asn 


Arg 


Leu 


Pro 


Gly 


lie 


Phe 


Asp 


10 




20 










25 








Asp 


Val 


His 


Gly 


Ser 


His 


Gly 


Arg 


Gin 








30 










35 






Val 


Cys 


Arg 


Arg 


His 


Glu 


Leu 


Tyr 


Val 


ID 








40 










45 




Ser 


Phe 


Gin 


Asp 


Leu 


Gly 


Trp 


Leu 


Asp 












50 










20 


Trp 


Val 


He 


Ala 


Pro 


Gin 


Gly 


Tyr 


Ser 




55 










60 










Ala 


Tyr 


Tyr 


Cys 


Glu 


Gly 


Glu 


Cys 


Ser 






65 










70 






25 






















Phe 


Pro 


Leu 


Asp 


Ser 


Cys 


Met 


Asn 


Ala 








75 










80 






Thr 


Asn 


His 


Ala 


lie 


Leu 


Gin 


Ser 


Leu 


30 








85 










90 




Val 


His 


Leu 


Met 


Lys 


Pro 


Asn 


Ala 


Val 












95 










35 


Pro 


Lys 


Ala 


Cys 


Cys 


Ala 


Pro 


Thr 


Lys 




100 










105 










Leu 


Ser 


Ala 


Thr 


Ser 


Val 


Leu 


Tyr 


Tyr 






110 










115 






40 


Asp 


Ser 


Ser 


Asn 


Asn 


Val 


lie 


Leu 


Arg 








120 










125 






Lys 


His 


Arg 


Asn 


Met 


Val 


Val 


Lys 


Ala 


45 








130 










135 




Cys 


Gly 


Cys 


His 












50 






















(2) INFORMATION FOR SEQ ID NO:8: 















(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: mOP-2 (mature form) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
5 





Ala 


Ala 


Arg 


Pro 


Leu 


Lys 


Arg 


Arg 


Gin 




1 








5 










10 


Pro 


Lys 


Lys 


Thr 


Asn 


Glu 


Leu 


Pro 


His 




10 










15 










Pro 


Asn 


Lys 


Leu 


Pro 


Gly 


lie 


Phe 


Asp 


15 




20 










25 








Asp 


Gly 


His 


Gly 


Ser 


Arg 


Gly 


Arg 


Glu 








30 










35 






Val 


Cys 


Arg 


Arg 


His 


Glu 


Leu 


tyr 


Val 


20 








40 








45 




Arg 


Phe 


Arg 


Asp 


Leu 


Gly 


Trp 


Leu 


Asp 












50 










25 


Trp 


Val 


He 


Ala 


Pro 


Gin 


Gly 


Tyr 


Ser 




55 










60 










Ala 


Tyr 


Tyr 


Cys 


Glu 


Gly 


Glu 


Cys 


Ala 






65 










70 






30 


Phe 


Pro 


Leu 


Asp 


Ser 


Cys 


Met 


Asn 


Ala 








75 










80 






Thr 


Asn 


His 


Ala 


lie 


Leu 


Gin 


Ser 


Leu 


35 








85 










90 




Val 


His 


Leu 


Met 


Lys 


Pro 


Asp 


Val 


Val 












95 










40 


Pro 


Lys 


Ala 


Cys 


Cys 


Ala 


Pro 


Thr 


Lys 


100 










105 









45 


Leu 


Ser 


Ala 


Thr 


Ser 


Val 


Leu 


Tyr 


Tyr 






110 










115 






50 


Asp 


Ser 


Ser 


Asn 


Asn 


Val 


lie 


Leu 


Arg 






120 










125 






Lys 


His 


Arg 


Asn 


Met 


Val 


Val 


Lys 


Ala 










130 










135 


55 


Cys 


Gly 


Cys 


His 













(2) INFORMATION FOR SEQ ID NO:9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acids 

5 (C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: CBMP-2A(fx) 

io (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
15 1 5 10 

Asp Val Gly Trp Asn Asp Trp He Val Ala Pro 
15 20 

20 Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu 

25 30 

Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser 
35 40 

25 Thr Asn His Ala lie Val Gin Thr Leu Val Asn 

45 50 55 

Ser Val Asn Ser Lys lie Pro Lys Ala Cys Cys 
30 60 65 

Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu 
70 75 

Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys 

35 

80 85 



40 



Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly 
90 95 

Cys Arg 
100 



50 



(2) INFORMATION FOR SEQ ID NO:10: 
(i) SEQUENCE CHARACTERISTICS: 

55 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: CBMP-2B(fx) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



Cys Arg Arg His Ser 

,o 1 5 

Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 

10 15 

Asp Trp I.le Val Ala Pro Pro Gly Tyr Gin Ala 
' 5 20 25 

Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 
30 35 

20 Ala Asp His Leu Asn Ser Thr Asn His Ala lie 

40 45 

Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser 

50 55 60 

25 

He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu 

65 70 

Ser Ala lie Ser Met Leu Tyr Leu Asp Glu Tyr 
30 75 80 

Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
85 90 

35 

Val Val Glu Gly Cys Gly Cys Arg 

40 95 100 



45 (2) INFORMATION FOR SEQ ID NO:11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
so (B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

55 (A) NAME: DPP(ifx) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 



41 




EP 0 575 555 B1 



5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser 
15 10 

Asp Val Gly Trp Asp Asp Trp lie Val Ala Pro 
15 20 

Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly Lys 
25 30 

Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser 
35 40 

Thr Asn His Ala Val Val Gin Thr Leu Val Asn 
45 50 55 

Asn Asn Asn Pro Gly Lys Val Pro Ly's Ala Cys 
60 -65 

Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met 
70 75 

Leu Tyr Leu Asn Asp Gin Ser Thr Val Val Leu 
80 85 
Lys Asn Tyr Gin Glu Met Thr Val Val Gly Cys 
90 95 

Gly Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Vgl(fx) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
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5 



10 



IS 



20 



25 



30 



Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys 
1 5 10 

Asp Val Gly Trp Gin Asn Trp Val lie Ala Pro 
15 20 

Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly Glu 
25 30 

Cys Pro Tyr Pro Leu Thr Glu lie Leu Asn Gly 
35 40 

Ser Asn His Ala He Leu Gin Thr Leu Val His 
45 50 55 

Ser lie Glu Pro Glu Asp lie Pro Leu Pro Cys 

60 65 
Cys Val Pro Thr Lys Met Ser Pro lie Ser Met 
70 75 

Leu Phe Tyr Asp Asn Asn Asp Asn Val Val Leu 
80 85 

Arg His Tyr Glu Asn Met Ala Val Asp Glu Cys 
90 95 

Gly Cys Arg 
100 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Vgr-I(fx) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



50 



55 
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Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin 

5 1 5 10 

Asp Val Gly Trp Gin Asp Trp He lie Ala Pro 
15 20 

Xaa Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu 
10 25 30 

Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala 
35 40 

,5 Thr Asn His Ala lie Val Gin Thr Leu Val His 

45 50 55 

Val Met Asn Pro Glu Tyr Val Pro Lys Pro Cys 

60 65 

£ ° Cys Ala Pro Thr Lys Val Asn Ala lie Ser Val 

70 75 

Leu Tyr Phe Asp Asp Asn Ser Asn Val lie Leu 
25 80 85 

Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys 
90 95 

Gly Cys His 
100 

INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 

(D) OTHER INFORMATION: /product= "GDF-1 (fx)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



55 



(2) 

35 



40 



45 



50 
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Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 

1 5 10 

5 Trp His Arg Trp Val lie Ala Pro Arg Gly Phe Leu Ala Asn iyr 

15 20 25 

Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 

30 35 40 

10 

Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 

45 50 55 

Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 

is 60 65 70 

Arg Leu Ser Pro He Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 

75 80 85 

20 Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 

90 95 100 

Cys Arg 
105 

25 

(2) INFORMATION FOR SEQ ID NO:1 5: 

(i) SEQUENCE CHARACTERISTICS: 



30 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



Cys Xaa Xaa Xaa Xaa 
1 5 



45 

(2) INFORMATION FOR SEQ ID NO:16: 



(i) SEQUENCE CHARACTERISTICS: 



50 (A) LENGTH: 1 822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOBO SAPIENS 
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5 



10 



15 



20 



25 



30 



35 



40 



(F) TISSUE TYPE: HIPPOCAHPUS 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49..1341' 

(D) OTHER INFORMATION:/standard_name= "hOPI” 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 

Met His Val 
1 

CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA 

Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala 

5 10 15 

CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 

Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 

20 25 30 35 

GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 

Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser Gin Glu Arg 

40 45 50 

CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC 

Arg Glu Met Gin Arg Glu He Leu Ser lie Leu Gly Leu Pro His Arg 

55 60 65 

CCG CGC CCG CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG 

Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met 

70 75 80 

CTG GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC 

Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly 

85 90 95 

GGC CAG GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC 

Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly 

100 105 110 115 



45 



50 



55 



57 

105 

153 

201 

249 

297 

345 

393 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ccc 


CCT 


CTG 


GCC 


AGC 


CTG 


CAA 


GAT 


AGC 


CAT 


TTC 


CTC 


ACC 


GAC 


GCC 


GAC 


441 


Pro 


Pro 


Leu 


Ala 


Ser 


Leu 


Gin 


Asp 


Ser 


His 


Phe 


Leu 


Thr 


Asp 


Ala 


Asp 












120 










125 








130 




ATG 


GTC 


ATG 


AGC 


TTC 


GTC 


AAC 


CTC 


GTG 


GAA 


CAT 


GAC 


AAG 


GAA 


TTC 


TTC 


489 


Met 


Val 


Met 


Ser 


Phe 


Val 


Asn 


Leu 


Val 


Glu 


His 


Asp 


Lys 


Glu 


Phe 


Phe 








135 










140 






145 








CAC 


CCA 


CGC 


TAC 


CAC 


CAT 


CGA 


GAG 


TTC 


CGG 


TTT 


GAT 


CTT 


TCC 


AAG 


ATC 


537 


His 


Pro 


Arg 


Tyr 


His 


His 


Arg 


Glu 


Phe 


Arg 


Phe 


Asp 


Leu 


Ser 


Lys 


lie 








150 










155 










160 








CCA 


GAA 


GGG 


GAA 


GCT 


GTC 


ACG 


GCA 


GCC 


GAA 


TTC 


CGG 


ATC 


TAC 


AAG 


GAC 


585 


Pro 


Glu 


Gly 


Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Glu 


Phe 


Arg 


lie 


Tyr 


Lys 


Asp 






165 










170 










175 






TAC 


ATC 


CGG 


GAA 


CGC 


TTC 


GAC 


AAT 


GAG 


ACG 


TTC 


CGG 


ATC 


AGC 


GTT 


TAT 


633 


Tyr 


lie 


Arg 


Glu 


Arg 


Phe 


Asp 


Asn 


Glu 


Thr 


Phe 


Arg 


lie 


Ser 


Val 


Tyr 




180 










185 










190 








195 




CAG 


GTG 


CTC 


CAG 


GAG 


CAC 


TTG 


GGC 


AGG 


GAA 


TCG 


GAT 


CTC 


TTC 


CTG 


CTC 


681 


Gin 


Val 


Leu 


Gin 


Glu 


His 


Leu 


Gly 


Arg 


Glu 


Ser 


Asp 


Leu 


Phe 


Leu 


Leu 












200 










205 








210 






GAC 


AGC 


CGT 


ACC 


CTC 


TGG 


GCC 


TCG 


GAG 


GAG 


GGC 


TGG 


CTG 


GTG 


TTT 


GAC 


' 729 


Asp 


Set 


Arg 


Thr 


Leu 


Trp 


Ala 


Ser 


Glu 


Glu 


Gly 


Trp 


Leu 


Val 


Phe 


Asp 








215 










220 








225 






ATC 


ACA 


GCC 


ACC 


AGC 


AAC 


CAC 


TGG 


GTG 


GTC 


AAT 


CCG 


CGG 


CAC 


AAC 


CTG 


777 


He 


Thr 


Ala 


Thr 


Ser 


Asn 


His 


Trp 


Val 


Val 


Asn 


Pro 


Arg 


His 


Asn 


Leu 








230 










235 










240 










GGC 


CTG 


CAG 


CTC 


TCG 


GTG 


GAG 


ACG 


CTG 


GAT 


GGG 


CAG 


AGC 


ATC 


AAC 


CCC 


825 


Gly 


Leu 


Gin 


Leu 


Ser 


Val 


Glu 


Thr 


Leu 


Asp 


Gly 


Gin 


Ser 


lie 


Asn 


Pro 






245 










250 






255 












AAG 


ttc; 


GCG 


GGC 


CTG 


ATT 


GGG 


CGG 


CAC 


GGG 


CCC 


CAG 


AAC 


AAG 


CAG 


CCC 


873 


Lys 


Leu 


Ala 


Gly 


Leu 


lie 


Gly 


Arg 


His 


Gly 


Pro 


Gin 


Asn 


Lys 


Gin 


Pro 




260 










265 








270 








275 




TTC 


ATG 


GTG 


GCT 


TTC 


TTC 


AAG 


GCC 


ACG 


GAG 


GTC 


CAC 


TTC 


CGC 


AGC 


ATC 


921 


Phe 


Met: 


Val 


Ala 


Phe 


Phe 


Lys 


Ala 


Thr 


Glu 


Val 


His 


Phe 


Arg 


Ser 


lie 












280 










285 








290 






CGG 


TCC 


ACG 


GGG 


AGC 


AAA 


CAG 


CGC 


AGC 


CAG 


AAC 


CGC 


TCC 


AAG 


ACG 


CCC 


969 


Arg 


Ser 


Thr 


Gly 


Ser 


Lys 


Gin 


Arg 


Ser 


Gin 


Asn 


Arg 


Ser 


Lys 


Thr 


Pro 










295 










300 








305 








AAG 


AAC 


CAG 


GAA 


GCC 


CTG 


CGG 


ATG 


GCC 


AAC 


GTG 


GCA 


GAG 


AAC 


AGC 


AGC 


1017 


Lys 


Asn 


Gin 


Glu 


Ala 


Leu 


Arg 


Met 


Ala 


Asn 


Val 


Ala 


Glu 


Asn 


Ser 


Ser 





310 315 320 
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agc gac cag agg cag gcc tgt aag aag cac gag ctg tat gtc agc ttc 

Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe 

5 325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 

Arg Asp Leu Gly Trp Gin Asp Trp He lie Ala Pro Glu Gly Tyr Ala 

340 . 345 350 355 

10 GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 
360 365 370 

AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 
15 Asn Ala Thr Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn 
375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 

Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 

20 390 395 400 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 

lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie Leu Lys Lys 

405 410 415 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 
Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 



30 



35 



40 



420 4: 
GAGAATTCAG ACCCTTTGGG 
GAACCAGCAG ACCAACTGCC 
TGTGAGAGTA TTAGGAAACA 
ATCCAATGAA CAAGATCCTA 
GCATAAAGAA AAATGGCCGG 
CGTTTCCAGA GGTAATTATG 
GGCGTGGCAA GGGGTGGGCA 
CTGTAATAAA TGTCACAATA 



!5 430 
GCCAAGTTTT TCTGGATCCT 
TTTTGTGAGA CCTTCCCCTC 
TGAGCAGCAT ATGGCTTTTG 
CAAGCTGTGC AGGCAAAACC 
GCCAGGTCAT TGGCTGGGAA 
AGCGCCTACC AGCCAGGCCA 
CATTGGTGTC TGTGCGAAAG 
AAACGAATGA ATGAAAAAAA 



CCATTGCTCG CCTTGGCCAG 
CCTATCCCCA ACTTTAAAGG 
ATCAGTTTTT CAGTGGCAGC 
TAGCAGGAAA AAAAAACAAC 
GTCTCAGCCA TGCACGGACT 
CCCAGCCGTG GGAGGAAGGG 
GAAAATTGAC CCGGAAGTTC 
AAAAAAAAAA A 



45 

(2) INFORMATION FOR SEQ ID NO:1 7: 

(i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (ix) FEATURE: 

(D) OTHER INFORMATION: /Product="OP1 -PP" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 



1065 

1113 

1161 

1209 

1257 

1305 

1351 

1411 

1471 

1531 

1591 

1651 

1711 

1771 

1822 
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Met His Val Arg Set Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

1 5 10 15 

5 Leu Tip Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 

20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Are Ser 
35 40 45 

w Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser lie Leu Gly Leu 

50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

15 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 

85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 

20 100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

25 Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 

130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 

1*5 150 155 160 

30 

Ser Lys lie Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie 
165 170 175 

Tyr Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg lie 
35 180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 

„„ Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 

40 210 215 220 

Val Phe Asp lie Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 

225 230 235 240 

45 His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

245 250 255 



50 



55 
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lie Asn Pro Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn 

5 260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 

275 280 285 

Arg Ser lie Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 

10 290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 

305 310 315 320 

15 Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 

325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp lie He Ala Pro Glu 
340 345 350 

20 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu Val His 

25 370 375 380 

Phe lie Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 

385 390 395 400 

30 Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie 

405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

35 

(2) INFORMATION FOR SEQ ID NO:1 8: 

(i) SEQUENCE CHARACTERISTICS: 

40 (A) LENGTH: 1 873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

50 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1393 

55 (D) OTHER INFORMATION: /note= "MOP1 (CDNA)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
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CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCC CCCTCCGCTG CCACCTGGGG 60 

cggcgcgggc CCGGTGCCCC GGATCGCGCG tagagccggc gcg ATG CAC GTG. CGC 115 

5 Met His Val Arg 

1 

TCG CT.rG CGC GCT GCG GCG CCA CAC AGC TTC GTG GCG CTC TGG GCG CCT 163 

Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro 

io 5 10 15 20 

CTG TTC TTG CTG CGC TCC GCC CTG GCC GAT TTC AGC CTG GAC AAC GAG 211 

Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu 

25 30 35 

15 

GTG CAC TCC AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG 259 

Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg 

40 45 50 

20 GAG ATG CAG CGG GAG ATC CTG TCC ATC TTA GGG TTG CCC CAT CGC CCG 307 

Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu Pro His Arg Pro 
55 60 65 

CGC CCG CAC CTC CAG GGA AAG CAT AAT TCG GCG CCC ATG TTC ATG TTG 355 

„ Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu 
70 75 80 

GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG AGC GGG CCG GAC GGA CAG 403 

Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly Pro Asp Gly Gin 

30 85 90 95 100 

GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT 451 

Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro 

105 110 115 

35 TTA GCC AGC CTG CAG GAC AGC CAT TTC CTC ACT GAC GCC GAC ATG GTC 499 

Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val 
120 125 130 

ATG AGC TTC GTC AAC CTA GTG GAA CAT GAC AAA GAA TTC TTC CAC CCT 547 

40 Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro 
135 140 145 

CGA TAC CAC CAT CGG GAG TTC CGG TTT GAT CTT TCC AAG ATC CCC GAG 595 

Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu 

45 150 155 160 

GGC GAA GCG GTG ACC GCA GCC GAA TTC AGG ATC TAT AAG GAC TAC ATC 643 

Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr lie 

165 170 175 180 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



CGG 


GAG 


CGA 


TTT 


GAC 


AAC 


GAG 


ACC 


TTC 


CAG 


ATC 


ACA 


GTC 


TAT 


CAG 


TGG 


Arg 


Glu 


Arg 


Phe 


Asp 


Asn 


Glu 


Thr 


Phe 


Gin 


lie 


Thr 


Val 


Tyr 


Gin 


Trp 






185 










190 








195 




CTC 


CAG 


GAG 


CAC 


TCA 


GGC 


AGG 


GAG 


TCG 


GAC 


CTC 


TTC 


TTG 


CTG 


GAC 


AGC 


Leu 


Gin 


Glu 


His 


Ser 


Gly 


Arg 


Glu 


Ser 


Asp 


Leu 


Phe 


Leu 


Leu 


Asp 


Ser 








200 










205 










210 




CGC 


ACC 


ATC 


TGG 


GCT 


TCT 


GAG 


GAG 


GGC 


TGG 


TTG 


GTG 


TTT 


GAT 


ATC 


ACA 


Arg 


Thr 


lie 


Trp 


Ala 


Ser 


Glu 


Glu 


Gly 


Trp 


Leu 


Val 


Phe 


Asp 


lie 


Thr 




215 










220 










225 






GCC 


ACC 


AGC 


AAC 


CAC 


TGG 


GTG 


GTC 


AAC 


CCT 


CGG 


CAC 


AAC 


CTG 


GGC 


TTA 


Ala 


Thr 


Ser 


Asn 


His 


Trp 


Val 


Val 


Asn 


Pro 


Arg 


His 


Asn 


Leu 


Gly 


Leu 




230 










235 








240 








CAG 


CTC 


TCT 


GTG 


GAG 


ACC 


CTG 


GAT 


GGG 


CAG 


AGC 


ATC 


AAC 


CCC 


AAG 


TTG 


Gin 


Leu 


Ser 


Val 


Glu 


Thr 


Leu 


Asp 


Gly 


Gin 


Ser 


lie 


Asn 


Pro 


Lys 


Leu 


245 










250 










255 








260 


GCA 


GGC 


CTG 


ATT 


GGA 


CGG 


CAT 


GGA 


CCC 


CAG 


AAC 


AAG 


CAA 


CCC 


TTC 


ATG 


Ala 


Gly 


Leu 


He 


Gly 


Arg 


His 


Gly 


Pro 


Gin 


Asn 


Lys 


Gin 


Pro 


Phe 


Met 










265 










270 








275 




GTG 


GCC 


TTC 


TTC 


AAG 


GCC 


ACG 


GAA 


GTC 


CAT 


CTC 


CGT 


AGT 


ATC 


CGG 


TCC 


Val 


Ala 


Phe 


Phe 


Lys 


Ala 


Thr 


Glu 


Val 


His 


Leu 


Arg 


Ser 


lie 


Arg 


Ser 








280 










285 








290 




ACG 


GGG 


GGC 


AAG 


CAG 


CGC 


AGC 


CAG 


AAT 


CGC 


TCC 


AAG 


ACG 


CCA 


AAG 


AAC 


Thr 


Gly 


Gly 


Lys 


Gin 


Arg 


Ser 


Gin 


Asn 


Arg 


Ser 


Lys 


Thr 


Pro 


Lys 


Asn 






295 










300 






305 






CAA 


GAG 


GCC 


CTG 


AGG 


ATG 


GCC 


AGT 


GTG 


GCA 


GAA 


AAC 


AGC 


AGC 


AGT 


GAC 


Gin 


Glu 


Ala 


Leu 


Arg 


Net 


Ala 


Ser 


Val 


Ala 


Glu 


Asn 


Ser 


Ser 


Ser 


Asp 




310 










315 










320 








CAG 


AGG 


CAG 


GCC 


TGC 


AAG 


AAA 


CAT 


GAG 


CTG 


TAC 


GTC 


AGC 


TTC 


CGA 


GAC 


Gin 


Arg 


Gin 


Ala 


Cys 


Lys 


Lys 


His 


Glu 


Leu 


Tyr 


Val 


Ser 


Phe 


Arg 


Asp 


325 










330 










335 








340 


CTT 


GGC 


TGG 


CAG 


GAC 


TGG 


ATC 


ATT 


GCA 


CCT 


GAA 


GGC 


TAT 


GCT 


GCC 


TAC 


Leu 


Gly 


Trp 


Gin 


Asp 


Trp 


lie 


He 


Ala 


Pro 


Glu 


Gly 


iyr 


Ala 


Ala 


Tyr 










345 










350 










355 


TAC 


TGT 


GAG 


GGA 


GAG 


TGC 


GCC 


TTC 


CCT 


CTG 


AAC 


TCC 


TAC 


ATG 


AAC 


GCC 


Tyr 


Cys 


Glu 


Gly 


Glu 


Cys 


Ala 


Phe 


Pro 


Leu 


Asn 


Ser 


Tyr 


Met 


Asn 


Ala 






360 










365 








370 






ACC 


AAC 


CAC 


GCC 


ATC 


GTC 


CAG 


ACA 


CTG 


GTT 


CAC 


TTC 


ATC 


AAC 


CCA 


GAC 


Thr 


Asn 


His 


Ala 


lie 


Val 


Gin 


Thr 


Leu 


Val 


His 


Phe 


lie 


Asn 


Pro 


Asp 






375 










380 










385 







691 

739 

787 

835 

883 

931 

979 

1027 

1075 

1123 

1171 

1219 

1267 
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ACA 


GTA 


CCC 


AAG 


CCC 


TGC 


TGT 


GCG 


CCC 


ACC 


CAG 


CTC AAC GCC ATC TCT 


1315 


Thr 


Val 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


Leu Asn Ala lie Ser 






390 








395 










400 




GTC 


CTC 


TAC 


TTC 


GAC 


GAC 


AGC 


TCT 


AAT 


GTC 


ATC 


CTG AAG AAG TAC AGA 


1363 


Val 


Leu 


Tyr 


Phe 


Asp 


Asp 


Ser 


Ser 


Asn 


Val 


He 


Leu Lys Lys Tyr Arg 




405 










410 










415 


420 




AAC 


ATG 


GTG 


GTC 


CGG 


GCC 


TGT 


GGC 


TGC 


CAC 


TAGCTCTTCC TGAGACCCTG 


1413 


Asn 


Met 


Val 


Val 


Arg 


Ala 


Cys 


Gly 


Cys 


His 
















425 








430 









ACCTTTGCGG GGCCACACCT TTCCAAATCT TCGATGTCTC ACCATCTAAG TCTCTCACTG 1473 
CCCACCXTGG CGAGGAGAAC AGACCAACCT CTCCTGAGCC TTCCCTCACC TCCCAACCGG 1533 
AAGCATGTAA GGGTTCCAGA AACCTGAGCG TGCAGCAGCT GATGAGCGCC CTTTCCTTCT 1593 
GGCACGTGAC GGACAAGATC CTACCAGCTA CCACAGCAAA CGCCTAAGAG CAGGAAAAAT 1653 
GTCTGCCAGG AAAGTGTCCA GTGTCCACAT GGCCCCTGGC GCTCTGAGTC TTTGAGGAGT 1713 
AATCGCMGC CTCGTTCAGC TGCAGCAGAA GGAAGGGCTT AGCCAGGGTG GGCGCTGGCG 1773 
TCTGTGCTGA AGGGAAACCA AGCAGAAGCC ACTGTAATGA TATGTCACAA TAAAACCCAT 1833 
GAATGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGAATTC 1873 



30 (2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

40 (D) OTHER INFORMATION: /product= "mOPI-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 



Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Set . Phe Val Ala 
« 15 10 15' 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

so Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 

35 40 45 



55 



53 
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Gin 


Glu 


Arg 


Arg 


Glu 


Met 


Gin 


Arg 


Glu 


He 


Leu 


Ser 


lie 


Leu 


Gly 


Leu 






50 






55 










60 










5 


Pro 


His 


Arg 


Pro 


Arg 


Pro 


His 


Leu 


Gin 


Gly 


Lys 


His 


Asn 


Ser 


Ala 


Pro 




65 






70 










75 










80 




Met 


Phe 


Met 


Leu 


Asp 


Leu 


Tyr 


Asn 


Ala 


Met 


Ala 


Val 


Glu 


Glu 


Ser 


Gly 


10 










85 








90 










95 




Pro 


Asp 


Gly 


Gin 


Gly 


Phe 


Ser 


Tyr 


Pro 


Tyr 


Lys 


Ala 


Val 


Phe 


Ser 


Thr 








100 










105 










110 








Gin 


Gly 


Pro 


Pro 


Leu 


Ala 


Ser 


Leu 


Gin 


Asp 


Ser 


His 


Phe 


Leu 


Thr 


Asp 


15 




115 










120 








125 








Ala 


Asp 


Met 


Val 


Met 


Ser 


Phe 


Val 


Asn 


Leu 


Val 


Glu 


His 


Asp 


Lys 


Glu 






130 










135 










140 






20 


Phe 


Phe 


His 


Pro 


Arg 


Tyr 


His 


His 


Arg 


Glu 


Phe 


Arg 


Phe 


Asp 


Leu 


Ser 




145 










150 








155 






160 




Lys 


He 


Pro 


Glu 


Gly 


Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Glu 


Phe 


Arg 


lie 


Tyr 


25 








165 










170 








175 


Lys 


Asp 


Tyr 


lie 


Arg 


Glu 


Arg 


Phe 


Asp 


Asn 


Glu 


Thr 


Phe 


Gin 


lie 


Thr 








180 








185 










190 








Val 


Tyr 


Gin 


Trp 


Leu 


Gin 


Glu 


His 


Ser 


Gly 


Arg 


Glu 


Ser 


Asp 


Leu 


Phe 


30 






195 










200 








205 








Leu 


Leu 


Asp 


Ser 


Arg 


Thr 


lie 


Trp 


Ala 


Ser 


Glu 


Glu 


Gly 


Trp 


Leu 


Val 






210 










215 










220 






35 


Phe 


Asp 


lie 


Thr 


Ala 


Thr 


Ser 


Asn 


His 


Trp 


Val 


Val 


Asn 


Pro 


Arg 


His 




225 










230 








235 








240 




Asn 


Leu 


Gly 


Leu 


Gin 


Leu 


Ser 


Val 


Glu 


Thr 


Leu 


Asp 


Gly 


Gin 


Ser 


lie 












245 










250 






255 




40 


Asn 


Pro 


Lys 


Leu 


Ala 


Gly 


Leu 


lie 


Gly 


Arg 


His 


Gly 


Pro 


Gin 


Asn 


Lys 










260 










265 










270 






Gin 


Pro 


Phe 


Met 


Val 


Ala 


Phe 


Phe 


Lys 


Ala 


Thr 


Glu 


Val 


His 


Leu 


Arg 


45 






275 










280 








285 








Ser 


lie 


Arg 


Ser 


Thr 


Gly 


Gly 


Lys 


Gin 


Arg 


Ser 


Gin 


Asn 


Arg 


Ser 


Lys 






290 










295 










300 








Thr 


Pro 


Lys 


Asn 


Gin 


Glu 


Ala 


Leu 


Arg 


Met 


Ala 


Ser 


Val 


Ala 


Glu 


Asn 


50 


305 










310 








315 










320 




Ser 


Ser 


Ser 


Asp 


Gin 


Arg 


Gin 


Ala 


Cys 


Lys 


Lys 


His 


Glu 


Leu 


Tyr 


Val 



325 330 335 

55 
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Ser 


Phe 


Arg 


Asp 


Leu 


Gly 


Trp 


Gin 


Asp 


Trp 


He 


lie 


Ala 


Pro 


Glu 


Gly 






340 










345 










350 






Tyr 


Ala 


Ala 


Tyr 


Tyr 


Cys 


Glu 


Gly 


Glu 


Cys 


Ala 


Phe 


Pro 


Leu 


Asn 


Ser 




355 










360 










365 








Tyr 


Met 


Asn 


Ala 


Thr 


Asn 


His 


Ala 


lie 


Val 


Gin 


Thr 


Leu 


Val 


His 


Phe 


370 










375 










380 










lie 


Asn 


Pro 


Asp 


Thr 


Val 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


Leu 


385 










390 










395 






400 




Asn 


Ala 


lie 


Ser 


Val 


Leu 


Tyr 


Phe 


Asp 


Asp 


Ser 


Ser 


Asn 


Val 


He 


Leu 










405 










410 










415 




Lys 


Lys 


Tyr 


Arg 


Asn 


Met 


Val 


Val 


Arg 


Ala 


Cys 


Gly 


Cys 


His 










420 










425 






430 







20 (2) INFORMATION FOR SEQ ID NO:20: 

. (i)SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1723 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: HIPPOCAMPUS 

35 (ix)FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 490..1696 

(D) OTHER INFORMATION: /note= "hOP2 (cDNA)” 

40 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO:20: 



GGCGCCGGCA GAGCAGGAGT GGCTGGAGGA 

45 

GGGCTGGAGG GCTCCCTATG AGTGGCGGAG 
CCACACCGCA CCAAGCGGTG GCTGCAGGAG 
so GCGGCC ACAG CCGGACTGGC GGGTACGGCG 

CCGDvGAGTA GCCCCGGCCT CGAGGCGGTG 
GACAGGTGTC GCGCGGCGGG GCTCCAGGGA 
CGCCCCGCCC CGCCGCCCGC CGCCCGCCGA 



GCTGTGGTTG GAGCAGGAGG 


TGGCACGGCA 


60 


ACGGCCCAGG AGGCGCTGGA 


GCAACAGCTC 


120 


CTCGCCCATC GCCCCTGCGC 


TGCTCGGACC 


180 


GCGACAGAGG CATTGGCCGA 


GAGTCCCAGT 


240 


GCGTCCCGGT CCTCTCCGTC 


CAGGAGCCAG 


300 


CCGCGCCTGA GGCCGGCTGC 


CCGCCCGTCC 


360 


GCCCAGCCTC CTTGCCGTCG 


GGGCGTCCCC 


420 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



AGGCCCTGGG TCGGCCGCGG AGCCGATGCG CGCCCGCTGA GCGCCCCAGC TGAGCGCCCC 

CGGCCTGCC ATG ACC GCG CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 
Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 
1 5 10 

GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC CTG CGA CCC CCG CCC 

Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly Leu Arc Pro Pro Pro 

15 20 25 

GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG CGC CGG GAC GTG CAG 

Gly Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin 

30 35 40 45 

CGC GAG ATC CTG GCG GTG CTC GGG CTG CCT GGG CGG CCC CGG CCC CGC 

Arg Glu lie Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg 

50 55 60 

GCG CCA CCC GCC GCC TCC CGG CTG CCC GCG TCC GCG CCG CTC TTC ATG 

Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met 

65 70 75 

CTG GAC CTG TAC CAC GCC ATG GCC GGC GAC GAC GAC GAG GAC GGC GCG 

Leu Asp Leu Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala 

80 85 90 

CCC GCG GAG CGG CGC CTG GGC CGC GCC GAC CTG GTC ATG AGC TTC GTT 

Pro Ala Glu Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val 

95 100 105 

AAC ATG GTG GAG CGA GAC CGT GCC CTG GGC CAC CAG GAG CCC CAT TGG 

Asn Met Val Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp 

110 115 120 125 

AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT GGG GAG GCG GTC 

Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala Gly Glu Ala Val 

130 135 140 

ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG CCC AGC ATC CAC CTG CTC 

Thr Ala Ala Glu Phe Arg lie Tyr Lys Val Pro Ser lie His Leu Leu 

145 150 155 

AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 

Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 

160 165 170 

AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT 

Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala 

175 180 185 

GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC AGT GAC TGC 

Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys 

190 195 200 205 



55 



480 

528 

576 

624 

672 

720 

768 

816 

864 

912 

960 

1008 

1056 

1104 
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5 



10 



IS 



20 



25 



30 



35 



40 



45 



50 



55 



TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG 

Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu 

210 215 220 

ACT GAG GAC GGG CAC AGC GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT 

Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly 

225 230 235 

CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC GTG GTC ACT TTC TTC AGG 

Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg 

240 245 250 

GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG GCA GTG AGG CCA CTG AGG 

Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Are 

255 260 265 

AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG GCC AAC CGA CTC 

Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Are Leu 

270 275 280 285 

CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG GTC TGC 

Pro Gly lie Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys 

290 295 300 

CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTC GGC TGG CTG GAC 

Arg Ar,g His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 

305 310 315 

TGG GTC ATC GCT CCC CAA GGC TAC TCG GCC TAT TAC TGT GAG GGG GAG 

Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu 

320 325 330 

TGC TCC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 

Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie 

335 340 345 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG 

Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala 

350 355 360 365 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC 

Cys Cyj; Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asd 

370 375 380 

AGC AGC AAC AAC GTC ATC CTG CGC AAA CAC CGC AAC ATG GTG GTC AAG 

Ser Ser Asn Asn Val lie Leu Arg Lys His Arg Asn Met Val Val Lys 

385 390 395 

GCC TGC GGC TGC CAC T GAGTCAGCCC GCCCAGCCCT ACTGCAG 
Ala Cys Gly Cys His 
400 



(2) INFORMATION FOR SEQ ID NO:21: 
(i)SEQUENCE CHARACTERISTICS: 



1152 

1200 

1248 

1296 

1344 

1392 

1440 

1488 

1536 

1584 

1632 

1680 

1723 
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(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii)MOLECULE TYPE: protein 

(ix)FEATURE: 

(A)OTHER INFORMATION: /product= n hOP2-PP" 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO:21: 

10 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
1 5 10 15 

Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro Gly Cys Pro 

is 20 25 30 

Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin Arg Glu lie 

35 40 45 

20 Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Pro Pro 

50 55 60 

Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu 

65 70 75 80 

25 

Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala Pro Ala Glu 
85 90 95 

Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val 

30 ioo 105 no 

Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp Lys Glu Phe 

115 120 125 

Arg Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val Thr Ala Ala 

130 135 \ 140 

Glu Phe Arg. lie Tyr Lys Val Pro Ser lie His Leu Leu Asn Are Thr 

145 150 155 160 

40 Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser Asn Arg Glu 

165 170 175 

Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp Glu 
180 185 190 

45 

Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys Trp Leu Leu 

195 200 205 



50 



55 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp 
210 215 220 

Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala 

225 230 235 240 

Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro 

245 250 255 

Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Arg Are Are Gin 
260 265 270 

Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu Pro Gly lie 
275 280 285 

Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys Arg Arg His 
290 295 300 

Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp Trp Val lie 

305 310 315 320 

Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser Phe 

325 330 335 

Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie Leu Gin Ser 
340 345 350 

Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala 
355 360 365 

Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr IVr Asp Ser Ser Asn 
370 375 380 

Asn Val lie Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly 

385 390 395 400 

Cys His 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 93.. 1289 



59 
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to 



15 



20 



25 



30 



35 



40 



45 



50 



(D) OTHER INFORMATION: /note= "mOP2 cDNA” 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTGGCG TCAGCCGAGC 

CCGACCAGCT ACCAGTGGAT GCGCGCCGGC TGAAAGTCCG AG ATG GCT ATG CGT 

Met Ala Met Arg 
1 



ccc 


GGG 


CCA 


CTC 


TGG 


CTA 


TTG 


GGC 


CTT 


GCT 


CTG 


TGC 


GCG 


CTG 


GGA 


GGC 


Pro 


Gly 


Pro 


Leu 


Trp 


Leu 


Leu 


Gly 


Leu 


Ala 


Leu 


Cys 


Ala 


Leu 


Gly 


Gly 


5 










10 










15 








20 


GGC 


CAC 


GGT 


CCG 


CGT 


CCC 


CCG 


CAC 


ACC 


TGT 


CCC 


CAG 


CGT 


CGC 


CTG 


GGA 


Gly 


His 


Gly 


Pro 


Arg 


Pro 


Pro 


His 


Thr 


Cys 


Pro 


Gin 


Arg 


Arg 


Leu 


Gly 










25 










30 






35 


GCG 


CGC 


GAG 


CGC 


CGC 


GAC 


ATG 


CAG 


CGT 


GAA 


ATC 


CTG 


GCG 


GTG 


CTC 


GGG 


Ala 


Arg 


Glu 


Arg 


Arg 


Asp 


Met 


Gin 


Arg 


Glu 


He 


Leu 


Pro 


Val 


Leu 


Gly 








40 










45 










50 




CTA 


CCG 


GGA 


CGG 


CCC 


CGA 


CCC 


CGT 


GCA 


CAA 


CCC 


GCG 


GCT 


GCC 


CGG 


CAG 


Leu 


Pro 


Gly 


Arg 


Pro 


Arg 


Pro 


Arg 


Ala 


Gin 


Pro 


Ala 


Ala 


Ala 


Arg 


Gin 






55 










60 










65 






CCA 


GCG 


TCC 


GCG 


CCC 


CTC 


TTC 


ATG 


TTG 


GAC 


CTA 


TAC 


CAC 


GCC 


ATG 


ACC 


Pro 


Ala 


Ser 


Ala 


Pro 


Leu 


Phe 


Met 


Leu 


Asp 


Leu 


Tyr 


His 


Ala 


Met 


Thr 




70 










75 








80 










GAT 


GAC 


GAC 


GAC 


GGC 


GGG 


CCA 


CCA 


CAG 


GCT 


CAC 


TTA 


GGC 


CGT 


GCC 


GAC 


Asp 


Asp 


Asp 


Asp 


Gly 


Gly 


Pro 


Pro 


Gin 


Ala 


His 


Leu 


Gly 


Arg 


Ala 


Asp 


85 










90 










95 






100 


CTG 


GTC 


ATG 


AGC 


TTC 


GTC 


AAC 


ATG 


GTG 


GAA 


CGC 


GAC 


CGT 


ACC 


CTG 


GGC 


Leu 


Val 


Met 


Ser 


Phe 


Val 


Asn 


Met 


Val 


Glu 


Arg 


Asp 


Arg- 


Thr 


Leu 


Gly 










105 










110 








115 


TAC 


CAG 


GAG 


CCA 


CAC 


TGG 


AAG 


GAA 


TTC 


CAC 


TTT 


GAC 


CTA 


ACC 


CAG 


ATC 


Tyr 


Gin 


Glu 


Pro 


His 


Trp 


Lys 


Glu 


Phe 


His 


Phe 


Asp 


Leu 


Thr 


Gin 


lie 








120 










125 








130 






CCT 


GCT 


GGG 


GAG 


GCT 


GTC 


ACA 


GCT 


GCT 


GAG 


TTC 


CGG 


ATC 


TAC 


AAA 


GAA 


Pro 


Ala 


Gly 


Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Glu 


Phe 


Arg 


lie 


Tyr 


Lys 


Glu 






135 










140 








145 




CCC 


AGC 


ACC 


CAC 


CCG 


CTC 


AAC 


ACA 


ACC 


CTC 


CAC 


ATC 


AGC 


ATG 


TTC 


GAA 


Pro 


Ser 


Thr 


His 


Pro 


Leu 


Asn 


Thr 


Thr 


Leu 


His 


lie 


Ser 


Met 


Phe 


Glu 




150 










155 










160 









55 



50 

104 

152 

200 

248 

296 

344 

392 

440 

488 

536 

584 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



GTG GTC CAA GAG CAC TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 
Val Veil Gin Glu His Ser Asn Arg Glu Ser Asp Leu Pbe Phe Leu Asp 
165 170 175 180 

CTT CAG ACG CTC CGA TCT GGG GAC GAG GGC TGG CTG GTG CTG GAC ATC 
Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu Asp lie 
185 190 195 

ACA GCA GCC AGT GAC CGA TGG CTG CTG AAC CAT CAC AAG GAC CTG GGA 
Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His Lys Asp Leu Gly 
200 205 210 

CTC CGC CTC TAT GTG GAA ACC GCG GAT GGG CAC AGC ATG GAT CCT GGC 
Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser Het Asp Pro Gly 
215 220 225 

CTG GCT GGT CTG CTT GGA CGA CAA GCA CCA CGC TCC AGA CAG CCT TTC 
Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin Pro Phe 
230 235 240 

ATG GTA ACC TTC TTC AGG GCC AGC CAG AGT CCT GTG CGG GCC CCT CGG 
Het Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala Pro Arg 
245 250 255 260 

GCA GCG AGA CCA CTG AAG AGG AGG CAG CCA AAG AAA ACG AAC GAG CTT 
Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu 
265 270 275 

CCG CAC CCC AAC AAA CTC CCA GGG ATC TTT GAT GAT GGC CAC GGT TCC 
Pro His Pro Asn Lys Leu Pro Gly He Phe Asp Asp Gly His Gly Ser 
280 285 290 

CGC GGC AGA GAG GTT TGC CGC AGG CAT GAG CTC TAC GTC AGC TTC CGT 
Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Are 
295 300 305 

GAC CTr GGC TGG CTG GAC TGG GTC ATC GCC CCC CAG GGC TAC TCT GCC 
Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala 
310 315 320 

TAT TAC TGT GAG GGG GAG TGT GCT TTC CCA CTG GAC TCC TGT ATG AAC 
Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn 
325 330 335 340 

GCC ACC AAC CAT GCC ATC TTG CAG TCT CTG GTG CAC CTG ATG AAG CCA 
Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met Lys Pro 
345 350 355 

GAT GTT GTC CCC AAG GCA TGC TGT GCA CCC ACC AAA CTG AGT GCC ACC 
Asp Va! Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
360 365 370 



632 
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872 

920 
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1016 

1064 
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TCT GTG CTG TAC TAT GAC AGC AGC AAC AAT GTC ATC CTG CGT AAA CAC 1256 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie Leu Are Lys His 

375 380 385 

CGT AAC ATG GTG GTC AAG GCC TGT GGC TGC CAC TGAGGCCCCG CCCAGCATCC 1309 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 

390 395 



TGCTTCTACT ACCTTACCAT CTGGCCGGGC 
TATCATAGCT CAGACAGGGG CAATGGGAGG 
is AAATTCTGGT CTTTCCCAGT TCCTCTGTCC 

CTCTCCATCC TCCTACCCCA AGCATAGACT 
ACTGAGAGGT CTGGGGTCAG CACTGAAGGC 

20 

CTCAGCCCAC AATGGCAAAT TCTGGATGGT 
GATCTGGGCT CTCTGCACCA TTCATTGTGG 
CATACACTTA GATCAATGCA TCGCTGTACT 

25 

AGAATCAGAG CCAGGTATAG CGGTGCATGT 
AGGAGAATCT CTGTGAGTTC AAGGCCACAT 
30 AAAAAAAAAC GGAATTC 

(2) INFORMATION FOR SEQ ID NO:23: 

35 (i) SEQUENCE CHARACTERISTICS: 



CCCTCTCCAG AGGCAGAAAC 


CCTTCTATGT 


1369 


CCCTTCACTT CCCCTGGCCA 


CTTCCTGCTA 


1429 


TTCATGGGGT TTCGGGGCTA 


TCACCCCGCC 


1489 


GAATGCACAC AGCATCCCAG 


AGCTATGCTA 


1549 


CCACATGAGG AAGACTGATC 


CTTGGCCATC 


1609 


CTAAGAAGGC CGTGGAATTC 


TAAACTAGAT 


1669 


CAGTTGGGAC ATTTTTAGGT 


ATAACAGACA 


1729 


CCTTGAAATC AGAGCTAGCT 


TGTTAGAAAA 


1789 


CATTAATCCC AGCGCTAAAG 


AGACAGAGAC 


1849 


AGAAAGAGCC TGTCTCGGGA 


GCAGGAAAAA 


1909 






1926 



(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(D) OTHER INFORMATION: /product= “mOP2-PP” 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

45 



Met Ala Met Arg Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 

1 5 10 15 

Ala Leu Gly Gly Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin 

20 25 30 

Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Met Gin Arg Glu He Leu Ala 
35 40 45 
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Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Gin Pro Ala Ala 

50 55 60 65 

Ala Arg Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr His Ala 

70 75 80 

Met Thr Asp Asp Asp Asp Gly Gly Pro Pro Gin Ala His Leu Gly Arg 
85 90 95 

Ala Asp Leu Val Met Ser Phe Val Asn Met Val Glu Arg Asp Arg Thr 
100 105 110 

Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp Leu Thr 

115 120 125 130 

Gin lie IPro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr 

135 140 145 

Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His lie Ser Met 
150 155 160 

Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe 
165 170 175 

Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu 
180 185 190 

Asp lie Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His Lys Asp 

195 200 205 210 

Leu Gly Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser Met Asp 

215 220 225 

Pro Gly leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin 
230 235 240 

Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala 
245 250 255 

Pro Arg Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn 
260 265 270 

Glu Leu Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp Gly His 

275 280 285 290 

Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser 

295 300 305 

Phe Arg Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr 
310 315 320 
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Ser 


Ala 


Tyr 

325 


Tyr 


Cys 


Glu 




Net 


Asn 

340 


Ala 


Thr 


Asn 


His 


10 


Lys 

355 


Pro 


Asp 


Val 


Val 


Pro 

360 




Ala 


Thr 


Ser 


Val 


Leu 

375 


Tyr 


15 


Lys 


His 


Arg 


Asn 

390 


Met 


Val 



20 

Claims 



Gly 


Glu 


Cys 


Ala 


Phe 


Pro 


Leu Asp 


Ser 


Cys 




330 










335 






Ala 


He 


Leu 


Gin 


Ser 


Leu 


Val His 


Leu 


Met 


345 










350 








Lys 


Ala 


Cys 


Cys 


Ala 


Pro 


Thr Lys 


Leu 


Ser 










365 






370 


Tyr 


Asp 


Ser 


Ser 


Asn 


Asn 


Val lie 


Leu 


Arg 








380 








385 


Val 


Lys 


Ala 


Cys 


Gly 


Cys 


His 










395 











1. A composition comprising: (1) a biocompatible acellular matrix that provides an anchoring substratum for mam- 
malian progenitor ceils and is essentially free of substances that inhibit morphogenesis; and (b) a morphogen 
sorbed on said matrix at a concentration sufficient for inducing the developmental cascade of tissue-specific mor- 
phogenesis when said composition is disposed at a non-skeletal tissue locus in vivo , said matrix having compo- 
nents specific for said tissue locus. 



30 2. The composition of claim 1, wherein said matrix is biodegradable and/or porous (e.g. being derived from organ- 

specific tissue or comprising collagen or cell attachment factors selected from glycosaminoglycans and proteogly- 
can). 

3. A composition comprising a morphogen in association with a matrix derived from dehydrated non-skeletal tissue. 

35 

4. The composition of claim 1 , 2 or 3 wherein said matrix is derived from liver tissue, or from demineralised and 
guanidine-extracted dentin, for example for use in dentinogenesis, liver regeneration, dentin regeneration or the 
repair of damaged liver or dentin tissue (e.g. arising from disease or mechanical injury). 

40 5 . The composition of any one of claims 1 to 4 for: 

(a) increasing a renewing progenitor cell population in a mammal; 

(b) inducing non-skeletal tissue morphogenesis in a mammal; 

(c) use in a method comprising the steps of providing progenitor cells disposed on said acellular matrix to a 

45 non-skeletal tissue locus in vivo to thereby induce non-skeletal tissue-specific differentiation and proliferation 

within said locus; 

(d) maintaining or restoring the phenotypic expression of differentiated mammalian cells {e.g. cells which have 
become (or are at risk of becoming) senescent or quiescent); 

(f) inducing the proliferation of mammalian progenitor cells (e.g. ex vivo), 
so 

6. Use of a morphogen (or the composition of any one of claims 1-5) for the manufacture of a medicament for pro- 
moting non-skeletal tissue regeneration. 

7. The use of claim 6 wherein the tissue is tissue other than bone and cartilage, for example: 

55 • * 

(a) lung tissue damaged by emphysema; 

(b) cirrhotic liver or kidney tissue; 

(c) damaged heart and/or blood vessel tissue (e.g. arising from cardiomyopathies, atherothrombotic or cardi- 
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oembolic strokes); 

(d) damaged stomach tissue arising from ulceration; 

(e) damaged neural tissue (e.g. arising from physical injury or degenerative diseases); 

(f) damaged dentin tissue (e.g. arising from disease or mechanical injury). 

8. The use of claim 7 (e) wherein the degenerative disease is Alzheimer's disease, multiple sclerosis or stroke. 

9. Use of a morphogen for the manufacture of a medicament for: 

(a) stimulating redifferentiation of cells in vivo (e.g. to reduce or substantially inhibit tumorigenesis and/or 
growth of neoplasms); 

(b) cancer therapy (e.g. via the inhibition of neoplasm growth); 

(c) stimulating the redifferentiation of committed cells in vivo\ 

(d) inducing the de- or redifferentiation of neoplasms. 

10. Use of a morphogen for the manufacture of a medicament for: 

(a) preventing or substantially inhibiting the formation of scar tissue; 

(b) the treatment of disorders arising from a loss or reduction of a renewable cell population (e.g. blood dis- 
orders and disorders arising from lost or impaired immune functions); 

(c) the treatment of disorders arising from cell (e.g. liver cells) senescence or quiescence (e.g. osteoporosis) 
via stimulation of said cells in vivo\ 

(d) use in gene therapy to stimulate the growth of quiescent cells and thereby enhance incorporation of exog- 
enous DNA: 

(e) detecting tissue dysfunction by monitoring levels of endogenous morphogens (e.g. to detect changes in 
tissue or organ stasis); 

(f) stimulating the proliferation of differentiated cells; 

(g) promoting tissue stasis; 

(h) dentinogenesis or dentin regeneration; 

(i) liver tissue formation; 

(j) skin tissue regeneration; 

(k) the healing of gastrointestinal ulcers; 

(l) non-skeletal tissue replacement in reconstructive surgery; 

(m) the treatment of degenerative nerve disease; 

(n) inducing enervation of a tissue; 

(o) use as a neuroprotectant; 

(p) use in CNS regeneration; 

(q) inducing tissue morphogenesis in mammalian tissue, which tissue is not bone or cartilage; 

(r) stimulating cells ex vivo followed by disposing said stimulated cells at a tissue-specific locus to induce tissue 
morphogenesis; 

(s) use in a method comprising the steps of isolating progenitor cells from bone marrow, stimulating said cells 
ex vivo to induce proliferation and returning said cells to said bone marrow; 

(t) use in a method comprising the steps of isolating biocompatible cells from a culture cell line, stimulating 
said cells ex vivo to induce proliferation and implanting said cells; 

(u) induce mitogenic activity of a progenitor cell population in vivo (e.g. using a morphogen adapted for systemic 
administration, implantation or injection); 

(v) increasing a selected population of hemopoietic stem cells (e.g. by extracting cells from blood by perfusion 
and stimulating said cells ex vivo to obtain stimulated cells suitable for being returned to the blood); 

(w) increasing a renewing progenitor cell population in a mammal; 

(x) inducing non-skeletal tissue morphogenesis in a mammal; 

(y) use in a method comprising the steps of providing progenitor cells disposed on said acellular matrix to a 
non-skeletal tissue locus in vivo to thereby induce non-skeletal tissue-specific differentiation and proliferation 
within said locus; 

(z) inducing non-skeletal replacement tissue at a non-skeletal tissue locus in a mammal; 

(aa) maintaining or restoring the phenotypic expression of differentiated mammalian cells (e.g. liver cells), 
which have for example become (or are at risk of becoming) senescent or quiescent); 

(ab) inducing the proliferation of mammalian progenitor cells (e.g. ex vivo), with the proviso that the cells 
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in (b), (c), (f), (r), (s), (t), (u), (w), and (ab) are cells of non-skeletal tissue. 

11 . The composition or use of any one of the preceding claims wherein the morphogen comprises a dimeric protein 
that induces tissue-specific morphogenesis in said mammal and comprises a pair of folded polypeptides, the amino 
acid sequence of each of which comprises a sequence sharing at least 70%, e.g. at least 80%, homology with 
residues 38-139 of hOPI (SEQ ID NO. 5). 

12 . The composition or use of claim 11 wherein said amino acid sequence has greater than 60%, e.g. greater than 
65%, amino acid identity with residues 43-139 of hOPt (SEQ ID NO. 5). 

13. The composition or use of claim 12 wherein said amino acid sequence is selected from the group consisting of 
residues 38-139 of hOPI (SEQ ID NO. 5); mOPI (SEQ ID NO. 6); hOP2 (SEQ ID NO. 7) or mOP2 (SEQ ID NO. 
8) or is a naturally-occurring or biosynthetic variant of a sequence selected from said group. 

14 . The composition or use of any one of claims 11-13 wherein said amino acid sequence is selected from the group 
consisting of SEQ ID NOs 9, 1 0, 1 1 , 1 2, 1 3 and 1 4, or a naturally-occurring or biosynthetic variant of said sequence. 

15 . The composition or use of any one of claims 11-14 wherein the sequence of said morphogen polypeptides each 
further comprise an amino acid sequence selected from residues 1-37 of the group consisting of SEQ ID NOs 5, 
6, 7 and 8, or a naturally-occurring or biosynthetic variant thereof, such that said sequences comprise a sequence 
selected from the sequences of mature hOPI , mOPI , hOP2 or naturally-occurring or biosynthetic variants thereof. 

16. A method of detecting non-skeletal tissue dysfunction by monitoring levels of endogenous morphogens (e.g. to 
detect changes in tissue or organ stasis). 

17 . An ex vivo method of: 

(a) increasing a population of mammalian progenitor cells; or 

(b) maintaining the phenotypic expression of differentiated mammalian cells (e.g. cells at risk of becoming 
senescent or quiescent); comprising the step of contacting said cells ex vivo with a morphogen at a concen- 
tration and for a time sufficient such that said progenitor cells are stimulated to proliferate or to express (or re- 
express) their phenotype, respectively, wherein said cells are cells of non-skeletal tissue. 



Patentanspruche 

1. Zusammensetzung umfassend: (1) Eine biokompatible azellulare Matrix, die ein Haftsubstrat fur Saugetier-Vor- 
lauferzellen bereitstellt und im wesentlichen frei von Substanzen ist, die die Morphogenese hemmen; und (b) ein 
an dieser Matrix in einer zur Induzierung der Entwicklungskaskade einer gewebsspezifischen Morphogenese sor- 
biertes Morphogen, wobei die Zusammensetzung in vivo an einem nicht-Skelett-Gewebsort angeordnet wird, wo- 
bei die Matrix Bestandteile aufweist, die fur diesen Gewebs-Ort spezifisch sind. 

2. Zusammensetzung nach Anspruch 1 , bei der die Matrix biologisch abbaubar und/oder poros ist (die beispielsweise 
aus organspezifischem Gewebe stammt Oder Kollagen Oder Zeilanlagerungsfaktoren umfaBt, die aus Glykosami- 
noglykanen und Proteoglykanen ausgewahlt sind). 

3. Zusammensetzung, die ein Morphogen in Verbindung mit einer Matrix umfaBt, die aus dehydriertem nicht-Skelett- 
Gewebe gewonnen wird. 

4 . Zusammensetzung nach Anspruch 1 , 2 Oder 3, bei der die Matrix aus Lebergewebe Oder aus demineralisiertem 
und Guanidinextrahiertem Dentin gewonnen wird, beispielsweise zur Verwendung in der Dentinogenese, der Le- 
berregeneration, der Dentin-regeneration Oder der Reparatur von beschadigtem Leber- Oder Dentingewebe (wobei 
die Schadigung beispielsweise durch Krankheit Oder mechanische Verletzung entstanden ist). 

5. Zusammensetzung nach einem der Anspruche 1 - 4 fur: 

(a) die Vermehrung einer sich erneuernden Vorlauferzell-Population in einem Saugetier; 
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(b) das tnduzieren einer nicht-Skelett-Gewebsmorphogenese in einem Saugetier; 

(c) die Verwendung in einem Verfahren, umfassend die Schritte, auf der azellularen Matrix angeordnete Vor- 
lauferzellen in vivo einem nicht-Skelett-Gewebs-Ort zu verabreichen, urn dadurch eine nicht-Skelett-gewebs- 

5 spezifische Differenzierung und Proliferation innerhalb des Ortes zu induzieren; 

(d) die Aufrechterhaltung Oder Wiederherstellung der phanotypischen Expression differenzierter Saugetier- 
zellen (beispielsweise Zeilen, die alt sind Oder ruhen (oder einem Risiko ausgesetzt sind, dieses zu werden)); 

10 (f) das Induzieren der Proliferation von Saugetier-Vorlauferzellen (beispielsweise ex vivo). 

6. Verwendung eines Morphogens (Oder der Zusammensetzung nach einem der Anspruche 1 - 5) zur Herstellung 
eines Medikamentes zur Unterstutzung einer nicht-Skelett-Gewebs-Regeneration. 

15 7 * Verwendung nach Anspruch 6, wobei das Gewebe ein anderes Gewebe als Knochen- und Knorpel-Gewebe ist, 

beispielsweise: 

(a) Lungengewebe, das durch ein Emphysem geschadigt ist; 

20 (b) zirrhotisches Leber- Oder Nierengewebe; 

(c) geschadigtes Herz- und/oder BlutgefaB-Gewebe (das beispielsweise aus Kardiomyopathien, arthero- 
thrombotischem Oder kardioembolischem Schlaganfali entstanden sind); 

25 (d) geschadigtes Magengewebe, das aus einer Ulzerierung entstanden ist; 

(e) geschadigtes Nervengewebe (das beispielsweise aus einer korperlichen Verletzung oder degenerativen 
Erkrankungen entstanden ist); 

30 (f) beschadigtes Dentingewebe (das beispielsweise aus einer Erkrankung oder mechanischen Verletzung ent- 

standen ist). 

8. Verwendung nach Anspruch 7 (e), bei der die degenerative Erkrankung die Alzheimer'sche Krankheit, multiple 
Sklerose Oder ein Schlaganfali ist. 

35 

9. Verwendung eines Morphogens zur Herstellung eines Medikamentes fur: 

(a) die Stimulierung der Redifferenzierung von Zeilen in vivo (beispielsweise zur Reduzierung oder zur im 
wesentlichen Hemmung der Tumorgenese und/oder des Wachstums von Neubildungen) ; 

40 

(b) die Krebstherapie (beispielsweise uber Hemmung des Neubiidungs-Wachstums); 

(c) die Stimulierung der Redifferenzierung von determinierten Zeilen in vivo; 

45 (d) das Induzieren der De- oder Redifferenzierung von Neubildungen. 

10. Verwendung eines Morphogens zur Herstellung eines Medikamentes fur: 

(a) die Vermeidung Oder im wesentlichen Hemmung der Bildung von Narbengewebe; 

50 

(b) die Behandlung von Storungen, die sich aus einem Verlust oder einer Verminderung einer erneuerbaren 
Zellpopulation ergeben (beispielsweise Blutstorungen und Storungen, die sich aus verlorenen oder ver- 
schlechterten Immunfunktionen ergeben); 

55 ( c ) die Behandlung von Storungen, die aus Zellalterung (beispielsweise von Leberzellen) oder Ruhezustand 

(beispielsweise Osteoporose) entstehen, uber die Stimulierung der Zeilen in vivo; 

(d) die Verwendung in der Gentherapie zur Stimulierung des Wachstums von ruhenden Zeilen und dadurch 
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zur Steigerung des Einbaus von exogener DNA; . 

(e) den Nachweis der Gewebs-Dysfunktion durch Beobachten der Spiegel von endogenen Morphogenen (bei- 
spielsweise zum Nachweis von Veranderungen der Gewebs- Oder Organstasis); 

(f) das Stimulieren der Proliferation von differenzierten Zellen; 

(g) die Forderung der Gewebs-Stasis; 

(h) die Dentinogenese Oder Dentin-Regeneration; 

(i) die Lebergewebsbildung; 

(j) die Hautgewebs-Regeneration; 

(k) die Heilung von gastrointestinalen Ulzera; 

(l) den nicht-Skelett-Gewebsersatz in der plastischen Chirurgie; 

(m) die Behandlung einer degenerativen Nervenerkrankung; 

(n) das Induzieren der Innervation eines Gewebes; 

(o) die Verwendung als Nervenschutzmittel; 

(p) die Verwendung in der ZNS-Regeneration; 

(q) das Induzieren der Gewebsmorphogenese in Saugetiergewebe, wobei das Gewebe nicht Knochen Oder 
Knorpel-Gewebe ist; 

(r) das ex vivo-Stimulieren von Zellen gefolgt von dem Anordnen der stimulierten Zellen in einem gewebsspe- 
zifischen Ort, zum Induzieren einer Gewebsmorphogenese; 

(s) die Verwendung in einem Verfahren, umfassend die Schritte, Vorlauferzellen aus Knochenmark zu isolie- 
ren, diese Zellen ex vivo zum Induzieren einer Proliferation zu stimulieren und diese Zellen zu dem Knochen- 
mark zu ubertragen; 

(t) die Verwendung in einem Verfahren, umfassend die Schritte, biokompatible Zellen aus einer Kulturzell- 
Linie zu isolieren, Zellen ex vivo zum Induzieren einer Proliferation zu stimulieren und diese Zellen zu implan- 
tieren; 

(u) das Induzieren einer mitogenen Aktivitat einer Vorlauferzell-Population in vivo (beispielsweise unter Ver- 
wendung eines Morphogens, das zur systemischen Verabreichung, Implantation Oder Injektion geeignet ist); 

(v) die Vermehrung einer ausgewahlten Population von hamatopoetischen Stammzellen (beispielsweise durch 
Extrahieren von Zellen aus dem Blut durch Perfusion und Stimulieren der Zellen ex vivo zur Gewinnung sti- 
mulierter Zellen, die dazu geeignet sind, wieder an das Blut zuruckgegeben zu werden) ; 

(w) die Vermehrung einer sich erneuernden Vorlauferzell-Population in einem Saugetier; 

(x) das Induzieren einer nicht-Skelett-Gewebsmorphogenese in einem Saugetier; 

(y) die Verwendung in einem Verfahren, umfassend die Schritte, auf der azellularen Matrix angeordnete Vor- 
lauferzellen einem nicht-Skelett-Gewebs-Ort in vivo zu verabreichen, urn dadurch eine nicht-Skelett-gewebs- 
spezifische Differenzierung und Proliferation innerhalb des Ortes zu induzieren; 

(z) das Induzieren eines nicht-Skelett-Ersatzgewebes in einem nicht-Skelett-Gewebs-Ort in einem Saugetier; 
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(aa) die Aufrechterhaltung Oder Wiederherstellung der phanotypischen Expression von differenzierten 
Saugetierzellen (beispielsweise Leberzellen), die beispielsweise ait Oder im Ruhezustand sind (oder ei- 
nem Risiko ausgesetzt sind, dieses zu werden) ; 

(ab) das Induzieren der Proliferation von Saugetier-Vorlauferzellen (beispielsweise ex vivo), unter dem 
Vorbehalt, daB die Zellen in (b), (c), (f), (r), (s), (t) , (u) , (w) und (ab) Zellen aus nicht-Skelett-Gewebe sind. 

11. Zusammensetzung Oder Verwendung nach einem der vorhergehenden Anspruche, bei der das Morphogen ein 
dimeres Morphogen umfaBt, das eine gewebsspezifische Morphogenese bei dem Saugetier induziert und das 
zwei gefaltete Polypeptide umfaBt, wobei deren Aminosauresequenz jeweils eine Sequenz umfaBt, die zumindest 
70%, beispielsweise zumindest 80% Homologie mit den Resten 38 bis 139 von hOPI (Sequenz ID Nr. 5) teilt. 

1 2. Zusammensetzung Oder Verwendung nach Anspruch 1 1 , bei der die Aminosauresequenz mehr als 60%, beispiels- 
weise mehr als 65%, Aminosaureidentitat mit den Resten 43 bis 139 von hOPI (Sequenz ID Nr. 5) aufweist. 

13. Zusammensetzung Oder Verwendung nach Anspruch 12, bei der die Aminosauresequenz aus der Gruppe aus- 
gewahlt ist, die aus den Resten 38 bis 1 39 von hOPI (Sequenz ID Nr. 5); mOPI (Sequenz ID Nr. 6); hOP2 (Sequenz 
ID Nr. 7) Oder mOP2 (Sequenz ID Nr. 8) ausgewahlt ist Oder eine naturlich vorkommende Oder biosynthetische 
Variante einer Sequenz ist, die aus dieser Gruppe ausgewahlt ist. 

14. Zusammensetzung oder Verwendung nach einem der Anspruche 11 - 13, wobei die Aminosauresequenz aus der 
Gruppe ausgewahlt ist, die aus den Sequenz ID Nr. 9, 10, 11, 12, 13 und 14 ausgewahlt ist Oder eine naturlich 
vorkommende Oder biosynthetische Variante dieser Sequenz ist. 

15. Zusammensetzung oder Verwendung nach einem der Anspruche 11-14, wobei die Sequenz der Morphogen- 
Polypeptide jeweils weiterhin eine Aminosauresequenz umfassen, die aus den Resten 1 bis 37 der Gruppe aus- 
gewahlt ist, die aus den Sequenz ID Nr. 5, 6, 7 und 8, oder eine naturlich vorkommenden Oder biosynthetischen 
Variante hiervon bestehen, derartig, daB die Sequenzen eine Sequenz umfassen, die aus den Sequenzen von 
reifem hOPI, mOPI, hOP2 oder naturlich vorkommenden Oder biosynthetischen Varianten hiervon ausgewahlt 
sind. 

1 6. Verfahren zum Nachweis einer nicht-Skelett-Gewebs-Dysfunktion durch Beobachtung der Spiegel endogener Mor- 
phogene (beispielsweise zum Nachweis von Veranderungen in der Gewebsoder Organstasis). 

17. Ex vivo- Verfahren der: 

(a) Vermehrung einer Population von Saugetier-Vorlauferzellen; oder 

(b) Aufrechterhaltung der phanotypischen Expression vori differenzierten Saugetierzellen (beispielsweise Zel- 
len, die einem Risiko ausgesetzt sind, alt Oder ruhend zu werden); 

Umfassend den Schritt, die Zellen ex vivo mit einem Morphogen in einer Konzentration und fur eine ausreichende 
Zeit lang in Beruhrung zu bringen, derartig, daB die Vorlauferzellen zur Poliferation Oder zum Exprimieren (oder 
wieder exprimieren) ihres Phanotyps stimuliert werden, wobei die Zellen Zellen aus nicht-Skelett-Gewebe sind. 



Revendications 

1. Composition comprenant: (1 ) une matrice acellulaire biocompatible qui fournit un substrat d'ancrage pour les cel- 
lules prog^nitrices de mammifere et est essentiellement libre de substances qui inhibent la morphogenese; et (b) 
un morphogene adsorb^ sur cette matrice k une concentration suffisante pour induire la cascade du d£veloppe- 
ment de la morphogenese specifique d’un tissu lorsqu'on introduit cette composition in vivo dans un locus d’un 
tissu non squelettique, cette matrice possedant les composants specifiques pour ce locus de tissu. 

2. La composition selon la revendication 1, dans laquelle cette matrice est biodegradable et/ou poreuse (p.ex. en 
etant derivee d’un tissu specifique d'un organe ou comprenant des facteurs de fixation aux cellules ou au collagfene 
choisis parmi les glycosaminoglycanes et les proteoglycanes). 
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3. Composition comprenant un morphogene en association avec une matrice issue d'un tissu non squelettique de- 
hydrate . 

4. La composition selon la revendication 1 , 2 ou 3 dans laquelle cette matrice est issue d'un tissu du foie, ou d'une 
dentine d§min6ralis6e et ou I'on extrait de la guanidine, p.ex. pour une utilisation en dentinogenese, regeneration 
du foie, regeneration de la dentine ou reparation de tissus endommages de la dentine ou du foie (p.ex. provenant 
d’une maladie ou d'une tesion rrtecanique). 

5. La composition selon I'une quelconque des revendications 1 k 4 pour: 

(a) accroTtre le renouvellement d'une population de cellules prog£nitrices chez un mammitere; 

(b) induire la morphog£nese d'un tissu non squelettique chez un mammitere; 

(c) utiliser dans une methode comprenant les Stapes de fournir des cellules progSnitrices disposSes sur cette 
matrice acellulaire sur un locus in vivo d'un tissu non squelettique pour induire par consequent une proliferation 
et difterenciation spScifique d'un tissu non squelettique k I'interieur de ce locus; 

(e) conserver ou restaurer I'expression phSnotypique des cellules difterenctees de mammiteres (p.ex. les cel- 
lules qui sont devenues (ou risquent de devenir) sSnescentes ou quiescentes); 

(f) induire la proliferation de cellules progSnitrices de mammiteres (p.ex. ex vivo). 

6. Utilisation d'un morphogene (ou de la composition d'une quelconque des revendications 1 k 5) pour la fabrication 
d'un medicament pour favoriser la regeneration d'un tissu non squelettique. 

7. Utilisation selon la revendication 6 dans laquelle le tissu est un tissu autre qu’un os et un cartilage, p.ex.: 

(a) un tissu d'un poumon endommage par un emphyseme pulmonaire; 

(b) un tissu de foie ou de rein endommage par une cirrhose; 

(c) un tissu endommage d'un vaisseau sanguin et/ou d'un vaisseau coronaire (p.ex. provenant d’attaques 
cardiomyopatiques, atherothrombotiques ou cardioemboliques); 

(d) un tissu endommage d'un estomac provenant d’un ulcere; 

(e) un tissu neuronal endommage (p.ex. provenant d'un lesion physique ou de maladie degenerative); 

(f) un tissu endommage de la dentine (p.ex. provenant d'une maladie ou d'une tesion mecanique). 

8. Utilisation selon la revendication 7 (e) dans laquelle la maladie degenerative est la maladie d'Alzheimer, une scle- 
rose multiple ou une attaque. 

9. Utilisation d'un morphogene pour la fabrication d'un medicament pour: 

(a) stimuler in vivo la redifterenciation des cellules (p.ex. pour diminuer ou inhiber essentiellement la tumori- 
genese et/ou la croissance des neoplasmes); 

(b) la therapie contre le cancer (p.ex. via I'inhibition de la croissance des neoplasmes); 

(c) stimuler la redifterenciation des cellules differences in vivo; 

(d) induire la de- ou redifterenciation des neoplasmes. 

10. Utilisation d'un morphogene pour la fabrication d’un medicament pour: 

(a) emp§cher ou inhiber essentiellement la formation d'une tesion tissulaire; 

(b) traiter les troubles provenant d'une perte ou d'une reduction d'une population de cellules renouvelables 
(p.ex. les troubles sanguins et les troubles provenant d’une perte ou d'un affaiblissement des fonctions immu- 
nitaires); 

(c) traiter les troubles provenant de la senescence ou de la quiescence des cellules (p.ex. cellules du foie) (p. 
ex. osteoporose) via une stimulation de ces cellules in vivo; 

(d) utilisation en therapie g6nique pour stimuler la croissance des cellules quiescentes et par consequent 
augmenter I'incorporation d’ADN exogene; 

(e) detecter les dysfonctionnements de tissu en controlant le niveau des morphogenes endogenes (p.ex. d6- 
tecter les changements de stase dans les tissus ou organes); 

(0 stimuler la proliferation des cellules differences; 

(g) ameiiorer la stase des tissus; 

(h) reg6nerer la dentine ou la dentinogenese; 
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(i) former un tissu hepatique; 

(j) r6g6n6rer un tissu de la peau; 

(k) gu^rir les ulc&res gastro-intestinaux; . 

(l) remplacer les tissus non squelettiques grace k une chirurgie reconstructive; 

(m) traiter les maladies nerveuses d£g£n6ratives; 

(n) induire Innervation d'un tissu; 

(o) utiliser sous; forme d'un neuroprotecteur; 

(p) utiliser dans la regeneration CNS; 

(q) induire la morphogenese d'un tissu dans un tissu d’un mammifere, ce tissu n'etant pas un os ou un cartilage; 

(r) stimuler les cellules ex vivo suivi par I'introduction de ces cellules stimuiees k un locus specifique d'un tissu 
pour induire la morphogenese d'un tissu; 

(s) utiliser dans une methode comprenant les etapes d'isolement des cellules progenitrices k partir de la moelle 
osseuse, stimuler ces cellules ex vivo pour induire la proliferation et retourner ces cellules vers cette moelle 
osseuse; 

(t) utiliser dans une methode comprenant les etapes d'isolement de cellules biocompatibles a partir d'une 
lignee de culture cellulaire, stimulant ces cellules ex vivo pour induire la proliferation et I'implantation de ces 
cellules; 

(u) induire I'activite mitogene d'une population de cellules progenitrices in vivo (p.ex. en utilisant un morpho- 
gene adapte pour une administration systemique, une implantation ou une injection) ; 

(v) augmenter une population choisie de cellules souches hemopoietiques (p.ex. par extraction des cellules 
k partir du sang par perfusion pour obtenir des cellules stimuiees appropriees pour etre retournees et stimuler 
ces cellules ex vivo vers le sang); 

(w) augmenter une population de cellules progenitrices k renouveler chez un mammifere; 

(x) induire une morphogenese d'un tissu non squelettique chez un mammifere; 

(y) utilisation dans une methode comprenant les etapes de fournir des cellules progenitrices disposees sur 
cette matrice acellulaire sur un locus d'un tissu non squelettique in vivo pour induire par consequent une 
proliferation et une differenciation specifique d'un tissu non squelettique k I'interieur de ce locus; 

(a’) induire le remplacement d’un tissu non squelettique k un locus d'un tissu non squelettique chez un 
mammifere; 

(aa) conserver ou restaurer I'expression ph6notypique des cellules differences de mammiferes (les cel- 
lules du foie), qui sont devenues p.ex. (ou risquent de devenir) senescentes ou quiescentes); 

(ab) induire la proliferation de cellules progenitrices de mammiferes (p.ex. ex vivo), a condition que ces 
cellules dans (b), (c), (f), (r), (s), (t), (u), (w), et (ab) sont des cellules d'un tissu non squelettique. 

11. Composition ou utilisation selon I'une quelconque des revendications pr6cedentes dans laquelle le morphogene 
comprend une proteine dimere qui induit une morphogenese specifique d'un tissu chez ce mammifere etcomprend 
une paire de polypeptides plies, dont la sequence d'acides amines de chacun comprend une sequence partageant 
au moins 70%, p.ex. au moins 80%, d'homologie avec des residus 38-139 de hOPI (SEQ ID NO. 5). 

12. Composition ou utilisation selon la revendication 11 dans laquelle cette sequence d'acides amines possede une 
identite d'acides amines superieure k 60%, p.ex. superieure k 65% avec des residus 43-139 de hOPI (SEQ ID 
NO. 5). 

13. Composition ou utilisation selon la revendication 12 dans laquelle on choisit cette sequence d'acides amines a 
partir du groupe comprenant les residus 38-139 de hOPI (SEQ ID NO. 5); mOPI (SEQ ID NO. 6); hOP2 (SEQ 
ID NO. 7) ou mOP2 (SEQ ID NO. 8) ou elle est un variant biosynthetique ou naturel d'une sequence choisie dans 
ce groupe. 

1 4. Composition ou utilisation selon I'une quelconque des revendications 11-13 dans laquelle on choisit cette sequence 
d'acides amines k partir du groupe comprenant les sequences ID No. 9, 10, 11, 12, 13 et 14, ou un variant bio- 
synthetique ou naturel de cette sequence. 



15. Composition ou utilisation selon I'une quelconque des revendications 11-14 dans laquelle chaque sequence sup- 
p!6mentaire de ces polypeptides de morphogenes comprend une sequence d'acides amines choisie a partir des 
residus 1-37 du groupe comprenant les sequences ID No. 5, 6, 7 et 8, ou un variant biosynthetique ou naturel de 
celui-ci, telles que ces sequences comprennent une sequence choisie k partir des sequences de variants biosyn- 
thetiques ou se produisant naturellement ou de hOPI , mOPI , hOP2 matures de celui-ci. 
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16. Methode de detection d'un dysfonctionnement d'un tissu non squelettique par controle des.niveaux de morpho- 
genes endogenes (p.ex. pour detecter les changements de stase dans les tissus ou les organes). 

17. M6thode ex vivo pour: 

5 

(a) accroltre une population de cellules progenitrices de mammifere; ou 

(b) conserver ('expression phSnotypique des cellules difterenciees de mammifere (p.ex. les cellules risquant 
de devenir s^nescentes ou quiescentes); 

10 comprenant retape de mettre ces cellules ex vivo en contact avec un morphogfcne & une concentration et 

une periode suffisante pour que ces cellules progenitrices soient stimulees pour proliterer et pour exprimer (ou re- 
exprimer) leur phenotype, respectivement, ou ces cellules sont des cellules d'un tissu non squelettique. 
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Fig. 2 
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Fig. 4B 
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Fig. 6B 
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Fig. 6C 




Fig. 6D 
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Fig. 8C 




